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Abstract. We follow a stream of the history of positive matrices and 
positive functionals, as applied to algebraic sums of squares decomposi- 
tions, with emphasis on the interaction between classical moment prob- 
lems, function theory of one or several complex variables and modern 
operator theory. The second part of the survey focuses on recently dis- 
covered connections between real algebraic geometry and optimization 
as well as polynomials in matrix variables and some control theory prob- 
lems. These new applications have prompted a series of recent studies 
devoted to the structure of positivity and convexity in a free *-algebra, 
the appropriate setting for analyzing inequalities on polynomials having 
matrix variables. We sketch some of these developments, add to them 
and comment on the rapidly growing literature. 



1. Introduction 

This is an essay, addressed to non-experts, on the structure of positive 
polynomials on semi-algebraic sets, various facets of the spectral theorem for 
Hilbert space operators, inequalities and sharp constraints for elements of a 
free *— algebra, and some recent applications of all of these to polynomial 
optimization and engineering. The circle of ideas exposed below is becoming 
increasingly popular but not known in detail outside the traditional groups 
of workers in functional analysis or real algebra who have developed parts 
of it. For instance, it is not yet clear how to teach and facilitate the access 
of beginners to this beautiful emerging field. The exposition of topics below 
may provide elementary ingredients for such a course. 

Partially supported by grants from the National Science Foundation and the Ford 
Motor Co. 
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The unifying concept behind all the apparently diverging topics men- 
tioned above is the fact that universal positive functions (in appropriate 
rings) are sums of squares. Indeed, when we prove inequalities we essen- 
tially complete squares, and on the other hand when we do spectral analysis 
we decompose a symmetric or a hermitian form into a weighted (possibly 
continuous) sum or difference of squares. There are of course technical diffi- 
culties on each side, but they do not obscure the common root of algebraic 
versus analytical positivity. 

We will encounter quite a few positivity criteria, expressed in terms of: 
matrices, kernels, forms, values of functions, parameters of continued frac- 
tions, asymptotic expansions and algebraic certificates. Dual to sums of 
squares and the main positive objects we study are the power moments 
of positive measures, rapidly decaying at infinity. These moments will be 
regarded as discrete data given by fixed coordinate frames in the correspon- 
dence between an algebra (of polynomials or operators) and its spectrum, 
with restrictions on its location. Both concepts of real spectrum (in algebraic 
geometry) and joint spectrum (in operator theory) are naturally connected 
in this way to moment problems. From the practitioner's point of view, mo- 
ments represent observable/computable numerical manifestations of more 
complicated entities. 

It is not a coincidence that the genius of Hilbert presides over all aspects of 
positivity we will touch. We owe him the origins and basic concepts related 
to: the spectral theorem, real algebra, algebraic geometry and mathematical 
logic. As ubiquitous as it is, a Hilbert space will show up unexpectedly and 
necessarily in the proofs of certain purely algebraic statements. On the other 
hand our limited survey does not aim at offering a comprehensive picture of 
Hilbert's much wider legacy. 

Not unexpected, or, better later than never, the real algebraist's positivity 
and the classical analyst's positive definiteness have recently merged into a 
powerful framework; this is needed and shaped by several applied fields of 
mathematics. We will bring into our discussion one principal customer: 
control theory. The dominant development in linear systems engineering in 
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the 1990's was matrix inequalities and many tricks and ad hoc techniques 
for making complicated matrix expressions into tame ones, indeed into the 
Linear Matrix Inequalities, LMIs, loved by all who can obtain them. Since 
matrices do not commute a large portion of the subject could be viewed as 
manipulation of polynomials and rational functions of non-commuting (free) 
variables, and so a beginning toward helpful mathematical theory would be 
a semi-algebraic geometry for free *-algebras, especially its implications for 
convexity. Such ventures sprung to life within the last five years and this 
article attempts to introduce, survey and fill in some gaps in this rapidly 
expanding area of noncommutative semi-algebraic geometry. 

The table of contents offers an idea of the topics we touch in the survey 
and what we left outside. We are well aware that in a limited space while 
viewing a wide angle, as captives of our background and preferences, we 
have omitted key aspects. We apologize in advance for all our omissions in 
this territory, and for inaccuracies when stepping on outer domains; they are 
all non-intentional and reflect our limitations. Fortunately, the reader will 
have the choice of expanding and complementing our article with several 
recent excellent surveys and monographs (mentioned throughout the text 
and some recapitulated in the last section). 

The authors thank the American Institute of Mathematics, Palo Alto, 
CA, for the unique opportunity (during a 2005 workshop) to interact with 
several key contributors to the recent theory of positive polynomials. They 
also thank the organizers of the "Real Algebra Fest, 2005", University of the 
Saskatchewan, Canada, for their interest and enthusiasm. The second author 
thanks the Real Algebra Group at the University of Konstanz, Germany, 
for offering him the possibility to expose and discuss the first sections of the 
material presented below. 

We dedicate these pages to Tibi Constantinescu, old time friend and col- 
league, master of all aspects of matrix positivity[j] 

1 Advice to the reader. Although the ordering of the material below follows a logic 
derived from a general theoretical and historical perspective, most of the sections, and 
sometimes even subsections, can be read independently. We have tried to keep to a mini- 
mum the number of cross references and we have repeated definitions. 
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2. The spectral theorem 

The modern proof of the spectral theorem for self-adjoint or unitary op- 
erators uses commutative Banach algebra techniques, cf. for instance [D03]. 
This perspective departs from the older, and more constructive approach 
imposed by the original study of special classes of integral operators. In this 
direction, we reproduce below an early idea of F. Riesz [R13] for defining 
the spectral scale of a self-adjoint operator from a minimal set of simple 
observations, one of them being the structure of positive polynomials on a 
real interval. 

2.1. Self-adjoint operators. Let H be a separable, complex Hilbert space 
and let A £ £{H) be a linear, continuous operator acting on H. We call 
A self-adjoint if A = A*, that is (Ax,x) G M for all vectors x £ H. The 
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continuity assumption implies the existence of bounds 

(2.1) m\\x\\ 2 < {Ax } x) < M\\x\\ 2 , x € H . 

The operator A is called non-negative, denoted in short A > 0, if 

(Ax, x) > 0, x € H. 

The operator A is positive if it is non-negative and ({Ax, x) = 0) => (x = 0). 

We need a couple of basic observations, see §104 of [RN90] . The real 
algebraists should enjoy comparing these facts with the axioms of an order 
in an arbitrary ring. 

a) . A bounded monotonic sequence of self- adjoint operators converges (in 
the strong operator topology) to a self-adjoint operator. 

Indeed, assume < Ai < A2 < ... < / and take B = A n+ k — A n for some 
fixed values of n, k £ N. Observe that < B < I, so Cauchy-Schwarz' 
inequality holds for the bilinear form (Bx,y). Use this to get: (Bx,Bx) 2 < 
(Bx, x)(B 2 x, Bx) < (Bx,x)(Bx,Bx) , from which 

\\Bx\\ 2 = (Bx,Bx) < (Bx,x) 

Thus, for every vector x G H: 

\\A n -\-kX — A n x\\ 2 < (An+kX, x) — (A n x, x) . 

Since the sequence (A n x, x) is bounded and monotonic, it has a limit. Hence 
lim n A n x exists for every x £ H, which proves the statement. 

b) . Every non-negative operator A admits a unique non-negative square 
root \fA: (\TA) 2 = A. 

For the proof one can normalize A, so that < A < I and use a convergent 
series decomposition for \fx = wl — (1 — x), in conjunction with the above 
remark. See for details §104 of [RN90]. 
Conversely, if T € L(H), then T*T > 0. 
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c). Let A,B be two commuting non-negative (linear bounded) operators. 
Then AB is also non-negative. 

Note that, if AB = BA, the above proof implies \f~BA = A^/B. For the 
proof we compute directly 

(ABx,x) = (AVbVBx,x) = 
(VBAVBx,x) = (AVBx,VBx) > 0. 

With the above observations we can enhance the polynomial functional 
calculus of a self-adjoint operator. Let C[i],M[i] denote the algebra of poly- 
nomials with complex, respectively real, coefficients in one variable and let 
A = A* be a self-adjoint operator with bounds (|2.ip . The expression p{A) 
makes sense for every p G C[t], and the polynomial functional calculus for 
A which is the map (f> 

p A p(A) 

is obviously linear, multiplicative and unital (1 maps to I). Less obvious is 
the key fact that that (f> is positivity preserving: 

Proposition 2.1. // the polynomial p £ M.[t] satisfies p(t) > for all t 
in [m, M] and the self-adjoint operator A satisfies ml < A < MI, then 
p(A) > 0. 

Proof. A decomposition of the real polynomial p into irreducible, real 
factors yields: 

P (t) =cH(t- ai ) niPj - 1) nut ~ lk f + 

i j k 

with c > 0, ai < m < M < Pj and jk G ^> $k £ According to the 
observation c) above, we find p(A) > 0. ■ 
The proposition immediately implies 
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Corollary 2.2. The homomorphism <p on C[t] extends to C[m,M] and be- 
yond. Moreover, 

\\p(A)\\ < sup \p\ =: HpHoo. 

[m,M] 

Proof. The inequality follows because supr mM ] \p\ ±p is a polynomial non- 
negative on [m, M], so ||p|| 00 i' > ±p(A) which gives the required inequality. 
Thus 4> is SU P norm continuous and extends by continuity to the completion 
of the polynomials, which is of course the algebra C[m, M] of the continuous 
functions. 

The Spectral Theorem immediately follows. 

Theorem 2.3. If the self adjoint bounded operator A on H has a cyclic 
vector £, then there is a positive Borel measure n on [m, M] and a unitary 
operator U : H ^ L 2 {^) identifying H with L 2 ([i) such that 

UAU* = M x . 

Here for any g in L°° the multiplication operator M g is defined by M g f = gf 
on all f G L 2 (/j,). 

The vector £ cyclic means 

span {A k £ i : k = 0, 1, 2. • • • } = {p(A)S i : p a polynomial } 
is dense in H. 

Proof Define a linear functional L : C([m, M]) ^ C by 

L(f) := (f(A)t, for all f G C([m, M}). 

The Representation Theorem (see Proposition 13.21 for more detail) for such 
L says there is a Borel measure a such that 

L(f) = [ fdn- 

J[m,M] 

moreover, n is a positive measure because if / > on [to, M], then L(f) > 0. 
A critical feature is 

(2.2) [pqdu=(p(A)a, q (A)0 
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which holds, since = L{pq) = (p(A)q(A)( t , £). We have built our representing 
space (using a formula which haunts the rest of this paper) and now we 
identify H with this space. 

Define U by Up(A)£ = p which specifies it on a dense set (by the cyclic as- 
sumption) provided Up\{A)^ = Up2(A)t; implies e(A)£ := pi(A)^—p2(A)^ = 
0; in other words, = (e(A)£, (?(A)£) for all polynomials q. Thus = / eqdfj,, 
so e = a.e. wrt [i. Now to properties of U: 

(1) U is isometric. (That is what (|2.2p says.) Thus U extends to H and 
has closed range. 

(2) The range of U is dense since it contains the polynomials. 

(3) U Ap{A)^ = xp{x) = xUp(A)£ for all polynomials p. By the density 
imposed by cyclicity for any v in H we have 



Note the constrction gives U£ = 1. ■ 

2.2. A bigger functional calculus and spectral measures. Our next 
aim is to consider a bounded, increasing sequence p n of real polynomial 
functions on the interval [m, M] and define, according to observation a): 



where / is a point-wise limit of p n . A standard argument shows that, if q n 
is another sequence of polynomials, monotonically converging on [m, M] to 
/, then 



See for details §106 of [RN90]. The new calculus / i— ► f(A) remains linear 
and multiplicative. 

In particular, we can apply the above definition to the step functions 



UAv = MMv. 



f(A)x = limp n (A)x, x £ H, 



lim q n (A) x = limp n ( A) x, x G H. 



n 




This yields a monotonic, operator valued function 



F A (s) = Xs (A) 



10 J. WILLIAM HELTON AND MIHAI PUTIN AR 

with the additional properties Fa(s) = Fa(s)* = Fa(s) 2 and 

f,M = ( °' e<m - 

{ I, s>M. 

With the aid of this spectral scale one can interpret the functional calculus 
as an operator valued Riemann-Stieltjes integral 

f(A) = / f(t)dF A (t). 

The spectral measure Ea of A is the operator valued measure associated to 
the monotonic function Fa, that is, after extending the integral to Borel 
sets a, 

E A (a) = [ dF A {t). 

Thus Ea(cf) is a family of mutually commuting orthogonal projections, sub- 
ject to the multiplicativity constraint 

E A (anT) = E A (a)E A (T). 

As a matter of notation, we have then for every bounded, Borel measurable 
function /: 

rM 

(2.3) f(A) = / f(t)E A (dt). 

J m 

This is a form of the Spectral Theorem which does not assume cyclicity. 

A good exercise for the reader is to identify the above objects in the case of 
a finite dimensional Hilbert space H and a self-adjoint linear transformation 
A acting on it. A typical infinite dimensional example will be discussed later 
in connection with the moment problem. 

2.3. Unitary operators. The spectral theorem for a unitary transforma- 
tion U £ L(H), U*U = UU* = I, can be derived in a very similar manner. 

The needed structure of positive polynomials is contained in the following 
classical result. 



Lemma 2.4 (Riesz-Fejer). A non-negative trigonometric polynomial is the 
modulus square of a trigonometric polynomial. 
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Proof. Let p(e ie ) = Ytd ^ and assume that p{e ie ) > 0, 9 G [0,2vr]. 
Then necessarily c-j = cj. By passing to complex coordinates, the rational 
function p{z) = Yl^d ]^ mus t be identical to p(l/z). That is its zeros and 
poles are symmetrical (in the sense of Schwarz) with respect to the unit 
circle. 

Write z d p{z) = q(z), so that q is a polynomial of degree 2d. One finds, in 
view of the mentioned symmetry: 

q{z) = cz v JJO - Xj) 2 Y[(z - Hk)(z ~ 1 /Wk), 

j k 

where c ^ is a constant, \Xj\ = 1 and < \/J,k\ < 1. 
For z = e td we obtain 

p{e w ) = \p{e w )\ = \q{e w \ = 
( , . . ,J0 \ |2 TT l e 



ni^-^i 2 n 



Returning to the unitary operator U we infer, for p G C[z], 

^(e ie ) > => Kp(C/) > 0. 

Indeed, according to the above Lemma, ^R.p(e t9 ) = \q(e ie )\ 2 , whence 

Mp(U) = q(U)*q(U) > 0. 

Then, exactly as in the preceding section one constructs the spectral scale 
and spectral measure of U. 

For an operator T we denote its "real part" and "imaginary part" by 
SRT = (T + T*)/2 and = (T-T*)/2i. 

The reader will find other elementary facts (a la Riesz-Fejer's Lemma) 
about the decompositions of non-negative polynomials into sums of squares 
in the second volume of Polya and Szego's problem book |PS25j . This par- 
ticular collection of observations about positive polynomials reflects, from 
the mathematical analyst point of view, the importance of the subject in 
the first two decades of the XX-th century. 
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2.4. Riesz-Herglotz formula. The practitioners of spectral analysis know 
that the strength and beauty of the spectral theorem lies in the effective 
dictionary it establishes between matrices, measures and analytic functions. 
In the particular case of unitary operators, these correspondences also go 
back to F. Riesz. The classical Riesz-Herglotz formula is incorporated below 
in a more general statement. To keep the spirit of positivity of the last 
sections, we are interested below in the additive (rather than multiplicative) 
structure of polynomials (or more general functions) satisfying Riesz-Fejer's 
condition: 

$lp(z) > 0, \z\ < 1. 

We denote by D the unit disk in the complex plane. Given a set X by a 
positive semi-definite kernel we mean a function K : X x X — ► C satisfying 

N 

^ K(xi,Xj)ciCj > 0, 

for every finite selection of points x\, xn € X and complex scalars c±, cat. 

Theorem 2.5. Let f : D — ► C be an analytic function. The following 
statements are equivalent: 

a) . > 0, z£D, 

b) . (Riesz-Herglotz formula). There exists a positive Borel measure (i on 
[— 7T, 7r] and a real constant C, such that: 

f{z)=iC+ ^±ld»(t), zGB, 

J -TV e Z 

c) . The kernel K/:DxD — > C, 

K f (z, w ) = f{z)+ W\ z,we®, 
1 — zw 

is positive semi- definite, 
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d). There exists a unitary operator U G C{H), a vector £ G H and a 
constant a G C, 3fta > 0, such that: 

f(z)=a + z((U-z)- 1 Z,0, zGD. 

Proof. We merely sketch the main ideas in the proof. The reader can 
consult for details the monograph [AM02J. 

a) => b). Let r < 1. As a consequence of Cauchy's formula: 

1 pit rp it _j_ „ 

f(z) = i3f/(0) + - / — j— Kf(re lt )dt, \z\ < r. 
Z7r J_ n re lt — z 

Since the positive measures ^?R.f(re lt )dt have constant mass on [— tt,tt]: 

J* W(re lt )dt = »/(0), r < 1, 

they form a weak—* relatively compact family (in the space of finite mea- 
sure). Any weak—* limit will satisfy the identity in b) (hence all limit points 
coincide) . 

b) => c). A direct computation yields: 

(2.4) K f (z, W ) = £ A ^_ z) 2 {e _ it __ ) d ,(t), ,«B. 

Since for a fixed value of t, the integrand is positive semi-definite, and we 
average over a positive measure, the whole kernel will turn out to be positive 
semi-definite. 

c) => a). Follows by evaluating Kf on the diagonal: 

2»/(s) = (1 - |z| 2 )K/(z,z) > 0. 

6) d). Let H = L 2 (n) and Uf(t) = e u f(t). Then 17 is a unitary 
operator, and the constant function £ = \/2 yields the representation d). 

d) =^ 6). In view of the spectral theorem, we can evaluate the spectral 
measure Ejj on the vector £ and obtain a positive measure \x satisfying: 

/(*) = a + z((U - z)-\, 0=a + z T = 

i r n e u + z i r 
a+ 2j ^rr~ z d ^ - 2 y z G D - 

By identifying the constants we obtain, up to the factor 2, conclusion b). ■ 
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The theorem above has far reaching consequences in quite divergent direc- 
tions: function theory, operator theory and control theory of linear systems, 
see for instance [AM021 IFF901 MM [RR97] . We confine ourselves to describe 
only a generic consequence. 

First, we recall that, exactly as in the case of finite matrices, a positive 
semi-definite kernel can be written as a sum of squares. Indeed, if K : 
X x X — ► C is positive semi-definite, one can define a sesqui-linear form 
on the vector space (BxexC, with basis e(x),x E X, by 



This is a positive semi-definite inner product. The associated separated (i.e. 
Hausdorff) Hilbert space completion H carries the classes of the vectors 
[e(x)] G H. They factor K into a sum of squares: 



where (/&) is any orthonormal basis of H. For details, see for instance the 
Appendix to |RN90j . 

The following result represents the quintessential bounded analytic inter- 
polation theorem. 

Theorem 2.6 (Nevanlinna-Pick) . Let {aj E D; % G 1} be a set of points in 
the unit disk, and let G C; > 0, i E 1} be a collection of points in 
the right half-plane, indexed over the same set. 

There exists an analytic function f in the unit disk, with > 0, \z\ < 

1, and /(oj) = Cj, i£ /, if and only if the kernel 



N 




K(x,y) = <[e(x)], [c(y)]> = £>(x)], /*></*, [efo)]) 



Q H~ 



1 — ajd,- 



zs positive semi- definite. 



Proof. Point c) in the preceding Theorem shows that the condition is 
necessary. 
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A Moebius transform in the range (/ i— > g = (/ — l)/(/ + 1)) will change 
the statement into: 



if and only if the kernel 



1 — didj 
1 — a-ioj 



g(cii) = di, 

i,j e I, 



is positive semi-definite. 

To prove that the condition in the statement is also sufficient, assume 
that the latter kernel is positive semi-definite. As before, factor it (into a 
sum of squares): 

1 — OjOj 

where h : / — ► H is a function with values in an auxiliary Hilbert space H. 
Then 

1 + {aMij^ajhQ)) = d { dj + (h(i),h(j)), i,j G I. 

The preceding identity can be interpreted as an equality between scalar 
products in C © H: 





) = ( ' u >. 



Let Hi C C © H be the linear span of the vectors (1, aih{i)) T , i € I. The 
map 





extends then by linearity to an isometric transformation V : Hi — > H. 
Since the linear isometry V can be extended (for instance by zero on the 
orthogonal complement of Hi) to a contractive linear operator T :C®H — ► 
C © H, we obtain a block matrix decomposition of T satisfying: 



1 ) = di 

aih(i) J \ h(i) 



A B 
C D 

Since ||D|| < 1, the operator / — zD is invertible for all z G D. From the 
above equations we find, after identifying A with a scalar: 

h(i) = (I - OiD) _1 Cl, di = A + ai Bh(i). 
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We define the analytic function 

g(z) = A + zB(I - zD)- l Cl, \z\ < 1. 

It satisfies, as requested: g{ai) = di, i £ I. 

By reversing the above reasoning we infer, with h(z) = (I—zD)^ 1 Cl E H: 




\\ g (z)r + \\Hz)r < i + \\zh( Z )r < 1 + u^r, \ z \ < 1, 

whence 

\g(z)\ < 1, \z\ < 1. 



The above proof contains the germ of what experts in control theory call 
"realization theory" . For the present survey it is illustrative as a constructive 
link between matrices and analytic functions with bounds; it will also be 
useful as a model to follow in more general, non-commutative settings. 

A great deal of research was done in the last two decades on analogs 
of Riesz-Herglotz type formulas in several complex variables. As expected, 
when generalizing to C n , there are complications and surprises on the road. 
See for instance [AM021 IBT98j ICW99j IEP02] and in several non-commuting 
variables [BGM051 IK05]. We will return to some of these topics from the 
perspective of positive polynomials and moment sequences. 

2.5. von Neumann's inequality. We have just seen that the heart of the 
spectral theorem for self-adjoint or unitary operators was the positivity of 
the polynomial functional calculus. A surprisingly general inequality, of the 
same type, applicable to an arbitrary bounded operator, was discovered by 
von Neumann [vN2j. 

Theorem 2.7. Let T £ £(H), \\T\\ < 1, be a contractive operator. If a 
polynomial p G C[z] satisfies "Sip(z) > 0, z£l, then ^kp{T) > 0. 
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Proof. According to Riesz-Herglotz formula we can write 

rn e it _|_ z 

p(z) =iC+ — r —dn{t), \z\ < 1, 

J -7T e Z 

where CsR and fj, is a positive measure. 

Fix r < 1, close to 1, and evaluate the above representation at z = rT: 



p(rT) = iC + / (e u + rT)(e u - rT)- 1 dfi(t) . 

J —n 

Therefore 

p{rT) + p(rT)* = 

r(e i *-rr)- 1 [(e i *+rr)(e- i '-rT*) + (e i '-rr)(e- i '+rr*)](e- i *-rT*)- 1 (i^(t) 

J — TT 

2 r (e u - rT)- 1 ^ - r 2 TT*](e- u - rT*)' 1 dfj,(t) > 0. 
Letting r^lwe find &p(T) > 0. ■ 

A Moebius transform argument, as in the proof of Nevanlinna-Pick The- 
orem, yields the equivalent statement (for a contractive linear operator T): 

(\p(z)\<l, \z\<l) b(T)||<l. 

Von Neumann's original proof relied on the continued fraction structure 
of the analytic functions from the disk to the disk. The recursive construc- 
tion of the continued fraction goes back to Schur [S18] and can be explained 
in a few lines. 

Schur's algorithm. Let / : D — * D be an analytic function. Then, in 
view of Schwarz Lemma, there exists an analytic function fx : D — > D with 
the property: 

/w - /(0) -,*w, 



i - /(<>)/(*: 

or equivalently, writing so = /(0): 

SQ + Zfl{z) 



1 + s zfi(z) 

In its turn, 

si + zf 2 (z) 
l + sTzf 2 (z) ' 



f(z) 
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with an analytic f% : D — ► D, and so on. 

This algorithm terminates after finitely many iterations for finite Blashcke 
products 

t=i 1 - x * z 

Its importance lies in the fact that the finite section of Schur parameters 
(sq, si, s n ) depends via universal expressions on the first section (same 
number) of Taylor coefficients of / at z = 0. Thus, the conditions 

\so(c )\ < 1, |si(co,ci)| < 1, . . . 

characterize which power series 

C + C\Z + c 2 z 2 + 

are associated to analytic functions from the disk to the disk. For details 
and a variety of applications, see |Constantinescu96l IFF9CH |RR97j. 

One notable application is to solve the classical Caratheodory-Fejer in- 
terpolation problem, a close relative of the Nevanlinna-Pick problem we 
presented earlier. Here one specifies complex numbers cq, • • ■ , c m and seeks 
/ : B — > D analytic for which 

The Schur Algorithm constructs such a function and in the same time gives 
a simple criterion when the solution exists. Alternatively, a special type of 
matrix (c n _ m )™ m=0 , with zero entries under the diagonal (cj = 0, j < 0), 
called a Toeplitz matrix, based on cq, ■ ■ ■ , c m is a contraction if and only if 
a solution to the Caratheodory-Fejer problem exists. A version of this fact 
in the right half plane (rather than the disk) is proved in Theorem 13.31 

As another application, we can derive (also following Schur) an effective 
criterion for deciding whether a polynomial has all roots inside the unit disk. 
Let 

p(z) = c d z d + c d -iz d - 1 + ... + c G C[z], 

and define 

p\z) = z d WF) = + ciz d ~ l + ... + Cd- 
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It is clear that 



\p{j t )\ = \p\j% t€[-7r,7r] 



and that the roots of p are symmetric with respect to the unit circle to the 
roots of p. Therefore, p has all roots contained in the open unit disk if and 
only if ^ is an analytic function from the disk to the disk, that is, if and 
only if the kernel 



is positive definite. As a matter of fact is a finite Blashcke product, and 
Schur's algorithm terminates in this case after finitely many iterations. 

In general, regarded as a Hermitian form, evaluated to the variables Zi = 
z' 1 , < i < d, the signature of the above kernel (that is the number of zeros, 
negative and positive squares in its canonical decomposition) counts how 
many roots the polynomial p has inside the disk, and on its boundary. For 
many more details see the beautiful survey [KN81| . 



In this section we return to Hilbert space and the spectral theorem, by 
unifying the analysis and algebra concepts we have discussed in the previous 
sections. This is done in the context of power moment problems, one of the 
oldest and still lively sources of questions and inspiration in mathematical 
analysis. 

As before, x = {x\, ...,x g ) stands for the coordinates in M 9 , and, at the 
same time, for a tuple of commuting indeterminates. We adopt the multi- 
index notation x a = x" 1 ...Xg B , a G N 5 . Let \i be a positive, rapidly decreas- 
ing measure on M 9 . The moments of jj, are the real numbers: 



For its theoretical importance and wide range of applications, the correspon- 
dence 



p^(z)p^(w) — p(z)p(w) 



z,w € D, 



1 — zw 



3. Moment problems 




{fi; positive measure} 



{(a a ); moment sequence} 
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can be put on an equal level with the Fourier-Laplace, Radon or wavelet 
transforms. It is the positivity of the original measure which makes the 
analysis of this category of moment problems interesting and non-trivial, 
and appropriate for our discussion. For general aspects and applications 
of moment problems (not treated below) the reader can consult the mono- 
graphs |Akh65l IBCR981 IFF901 IST43] and the excellent survey [F83]. The old 
article of Marcel Riesz jMR23^ remains unsurpassed for the classical aspects 
of the one variable theory. 

Given a multi-sequence of real numbers (a a ) ae ^a a linear functional rep- 
resenting the potential integral of polynomials can be defined as: 

L : R[x] — ► M, L(x a ) =a a , a £ N 9 , 

and vice-versa. When necessary we will complexify L to a complex linear 
functional on C[x\. 

if M Q £N9 are the moments of a positive measure, then for a polynomial 
p € M[x] we have 

L(p 2 ) = / p 2 dfi > 0. 

Moreover, in the above positivity there is more structure: we can define on 
C[x] a pre-Hilbert space bracket by: 

(P,Q) = L(pq), p,q E C[x]. 

The inner product is positive semi-definite, hence the Cauchy-Schwarz in- 
equality holds: 

IMI 2 <IHI 2 lkll 2 - 

Thus, the set of null- vectors N = {p G C[x]; \\p\\ = 0} is a linear subspace, 
invariant under the multiplication by any polynomial. Let H be the Hilbert 
space completion of C[x]/N with respect to the induced Hermitian form. 
Let T> = C[x]/N be the image of the polynomial algebra in H. It is a dense 
linear subspace, carrying the multiplication operators: 



M Xt : V — ► V, M Xi p = Xi p. 



POSITIVE POLYNOMIALS AND THE SPECTRAL THEOREM 

Note that these are well defined, symmetric linear operators: 



21 



(M Xi p, q) = L(xipq) = (p, M Xi q), p,qeV, 



and they commute 



M X .M X . = M X .M, 



Finally the (constant function) vector £ = 1 is cyclic, in the sense that V is 
the linear span of repeated actions of M X1 , M Xg on £: 



We collect these observations into a single statement. 

Proposition 3.1. There is a bijective correspondence between all linear 
Junctionals 



and the pairs (M, £) of g -tuples M = (M±, M g ) of commuting, symmetric 
linear operators with a cyclic vector £ (acting on a separable Hilbert space). 
The correspondence is given by the relation 



Above the word commuting has to be taken with caution: implicitly it is 
understood that we define the span V as before, and remark that every Mi 
leaves T> invariant. Then Mj commutes with Mj as endomorphisms of V. 

Having a positive measure pi represent the functional L adds in general 
new constraints in this dictionary. 

Let V+(K) be the set of all polynomials which are non- negative on the 
set K CR 9 and note that this is a convex cone. 

Proposition 3.2. A linear functional L 6 M[x]' is representable by a posi- 
tive measure [i: 



V= \/ M£..JW?/£. 



L G R[x]', L| S 2 M[a;] > 



L(p) = (p(M)Z,0, peR[x}. 




if and only if L\ P+(Rg ) > 0. 
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Although this observation (in several variables) is attributed to Haviland, 
see |Akh65| , it is implicitly contained in Marcel Riesz article [MR23] . Again 
we see exactly the gap 

S 2 M[x] C V+{R 9 ), 

which we must understand in order to characterize the moments of positive 
measures (as already outlined in Minkowski's and Hilbert's early works). 

Proof. If the functional L is represented by a positive measure, then it is 
obviously non-negative on all non-negative polynomials. 

To prove the converse, assume that L\p + me) > 0. Let C p Bd(^ 9 ) be the 
space of continuous functions / having a polynomial bound at infinity: 

\f(x)\<C(l + \x\) N , 

with the constants C, N > depending on /. We will extend L, following 
M. Riesz [MR23J, to a non-negative functional on C p Bd(M 9 )- 

This extension process, parallel and arguably prior to the Hahn-Banach 
Theorem, works as follows. Assume that 

L : V — > R 

is a positive extension of L to a vector subspace V C C p Bd(^ 9 )- That is: 

(h g V, h > 0) => (L(h) > 0). 

Remark that L is defined on all polynomial functions. Assume V is not the 
whole space and choose a non-zero function / G C p Bd(R 9 ) \ V ■ Since / has 
polynomial growth, there are elements hi, hi G V satisfying 

hi<f< h 2 . 

By the positivity of L, we see Lhi < Lf < Lhi, that is 

sup L(h\) < inf L(hz). 

hi<f f< h 'i 

Choose any real number c between these limits and define 
L'[h + A/) = L[h) + Ac, h G V, A G R. 
This will be a positive extension of L to the larger space V © Rf- 
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By a standard application of Zorn's Lemma, we find a positive extension 
of L to the whole space. Finally, F. Riesz Representation Theorem provides 
a positive measure [i on ~R g , such that L(p) = f pdfi, p £ M[x]. ■ 

Next we focus on a few particular contexts (either low dimensions, or 
special supporting sets for the measure) where the structure of the posi- 
tive functionals and tuples of operators appearing in our dictionary can be 
further understood. 

3.1. The trigonometric moment problem. We specialize to dimension 
n = 2 and to measures supported on the unit circle (torus) T = {z G 
C; \z\ = 1}. The group structure of T identifies our moment problem to 
the Fourier transform. It is convenient in this case to work with complex 
coordinates z = x + iy £C = M?, and complex valued polynomials. In 
general, we denote by £^C[x] the sums of moduli squares (i.e. \q\ 2 ) of 
complex coefficient polynomials. 

The ring of regular functions on the torus is 



where (1 — zz) denotes the ideal generated by 1 — zz. A non-negative linear 
functional L on T^A necessarily satisfies 



Hence L is determined by the complex moments L(z n ), n > 0. The following 
result gives a satisfactory solution to the trigonometric moment problem on 
the one dimensional torus. 

Theorem 3.3. Let (cn)^^^ be a sequence of complex numbers subject to 
the conditions cq > 0, c_ n = t~H, n > 0. The following assertions are 
equivalent: 

a). There exists a unique positive measure ji on T, such that: 



A = C[z,z]/(1 - zz) = C[z]®zC[z], 



L(f) = L(f), f€A. 




2-1 
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b) . The Toeplitz matrix (c n _ m )5f m=0 is positive semi- definite; 

c) . There exists an analytic function F : D — ► C, ?R.F > 0, such that 

oo 

F{z) =c + 2^2c- k z k , \z\<l; 
k=l 

d) . There exists a unitary operator U G L(H) and a vector £ 6 H cyclic 
for the pair (U,U*), such that 

(U n Z,0=c n , n>0. 

Proof. Let L : C[z,z]/(1 — zz) — ► C be the linear functional denned by 

L(z n ) =c n , n> 0. 

Condition b) is equivalent to 

L{\p\ 2 ) > 0, pGC[z,z]/(l-zz). 

Indeed, assume that p(z) = Y^j=o a j zJ ■ Then, since zz = 1, 

9 

\p(z)\ 2 = Yl a j®k zi ~ k > 

j,k=0 

whence 

g 

L i\p\ 2 ) = Y a jUk c j-k- 

j,k=0 

Thus a) =^ b) trivially. In view of the Riesz-Fejer Lemma, the functional 
L is non-negative on all non-negative polynomial functions on the torus. 
Hence, in view of Proposition 13.21 it is represented by a positive measure. 
The uniqueness is assured by the compactness of T and Stone- Weierstrass 
Theorem (trigonometric polynomials are uniformly dense in the space of 
continuous functions on T). The rest follows from Theorem 12.51 ■ 
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Notable in the above Theorem is the fact that the main objects are in 
bijective, and constructive, correspondence established essentially by Riesz- 
Herglotz formula. Fine properties of the measure fi can be transferred in this 
way into restrictions imposed on the generating function F or the unitary 
operator U. 

For applications and variations of the above result (for instance a matrix 
valued analog of it) the reader can consult [AM021 IAkh651 IFF90| IRR97] . 

3.2. Hamburger's moment problem. The passage from the torus to the 
real line reveals some unexpected turns, due to the non-compactness of the 
line. One may argue that the correct analog on the line would be the con- 
tinuous Fourier transform. Indeed, we only recall that Bochner's Theorem 
provides an elegant characterization of the Fourier transforms of positive 
measures. 

Instead, we remain consistent and study polynomial functions and positive 
measures acting on them. Specifically, consider an M-linear functional 

L : R[x] — » R, L\ S 2 nx] > 0. 

By denoting 

c k = L(x k ), k>0, 

the condition ^|s 2 r[o:] i s equivalent to the positive semi-definiteness of the 
Hankel matrix 

since 

< £/*c fe+i /, = Y J L{f k x k x l f l ) = LC£f k x k J2 xl fi) = L{f{xf). 

Next use that every non-negative polynomial on the line is a sum of squares 
of polynomials, to invoke Proposition 13.21 for the proof of the following clas- 
sical fact. 

Theorem 3.4 (Hamburger). Let (ck)^L be a sequence of real numbers. 
There exists a rapidly decaying, positive measure fj, on the real line, such 
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that 



oo 



Cfc = / x k dfi(x), k > 0, 



— oo 



if and only if the matrix (cfc+;)^ =0 is positive semi-definite. 



Now we sketch a second proof of Hamburger Theorem, based on the 
Hilbert space construction we have outlined in the previous section. Namely, 
start with the positive semi-definite matrix (cjfc+j)2°j_o an d construct a Hilbert 
space (Hausdorff) completion H of C[x], satisfying 



Let T> denote as before the image of the algebra of polynomials in H; the 
image is dense. The (single) multiplication operator 

(Mp)(x) = xp(x), p&V, 

is symmetric and maps T> into itself. Moreover, M commutes with the 
complex conjugation symmetry of H: 

~Mp = Mp. 

By a classical result of von-Neumann [vNl] there exists a self-adjoint (pos- 
sibly unbounded) operator A which extends M to a larger domain. Since 
A possesses a spectral measure Ea (exactly as in the bounded case), we 
obtain: 



The measure (Eji(dx)l, 1) is positive and has prescribed moments (c&). ■ 

This second proof offers more insight into the uniqueness part of Ham- 
burger's problem. Every self-adjoint extension A of the symmetric operator 
M produces a solution /i(dx) = (-Eyi(dx)i, 1). The set K of all positive mea- 
sures with prescribed moments (cfc) is convex and compact in the weak-* 
topology. The subset of Nevanlinna extremal elements of K are identified 
with the measures (EA(dx)l, 1) associated to the self-adjoint extensions A 



(x k ,x l ) = c k +i, k,l>0. 



c k = {x k ,l) = {M k l,l) 
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of M. In particular one proves in this way the following useful uniqueness 
criterion. 

Proposition 3.5. Let (c^) be the moment sequence of a positive measure \x 
on the line. Then a positive measure with the same moments coincides with 
H if and only if the subspace 

(il + M)V is dense in H, 

or equivalently, there exists a sequence of polynomials p n £ C[x] satisfying 

POO 

lim / \(i + x)p n (x) — 1| 2 dfi(x) = 0. 



n^oo 



Note that both conditions are intrinsic in terms of the initial data {c k ) . For 
the original function theoretic proof see [ MR23] . For the operator theoretic 
proof see for instance [Akh65j . 

There exists a classical analytic function counterpart of the above objects, 
exactly as in the previous case (see §2.41 §2.51 ) of the unit circle. Namely, 
assuming that 

/oo 
x k dn(x), k > 0, 
-oo 

as before, the analytic function 

J — oo * 

is well defined in the upper half-plane > and has the asymptotic ex- 
pansion at infinity (in the sense of Poincare, uniformly convergent in wedges 
< 5 < arg z < ir — 5) : 

F( z ) ~ — ~ ~~\ ' 3(*)>0. 

z z A 

One step further, we have a purely algebraic recursion which determines 
the continued fraction development 



co _ ci_ 

Z Z 2 a 

Po 

z - a 

Z — OL\ 



a k € M, p k > 0. 



z - a 2 - — 
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It was Stieltjes, and then Hamburger, who originally remarked that (c^) 
is the moment sequence of a positive measure if and only if the elements /3k 
in the continued fraction development of the generating (formal) series are 
non-negative. Moreover, in this case they proved that there exists a unique 
representing measure if and only if the continued fraction converges in the 
upper half-plane. For details and a great collection of classical examples 
see Perron's monograph [Per50]. A well known uniqueness criterion was 
obtained via this formalism by Carleman [C26| . It states that uniqueness 
holds if 



x 1 



1/(2*) 
1 C 2k 



OO. 



The condition is however not necessary for uniqueness. 

The alert reader has seen the great kinship between the continued fraction 
recursion just elucidated and the recursion called the Schur Algorithm in 
^2.51 These are essentially the same thing, but one is in the disk setting 
while the other is in the half plane. 

3.2.1. Moments on the semiaxis [0, oo]. The above picture applies with mi- 
nor modifications to Stieltjes problem, that is the power moment problem 
on the semi-axis [0, oo). 

Example 3.6. We reproduce below an example found by Stieltjes, and refined 
by Hamburger. See for details |Per50| . Let p and 5 be positive constants, 
and denote 

1 

a= 2T^' 1 = p ■ 



Then 



a n = (2 + S)p n+1 T[{2 + 5){n + 1)] = I 

Jo 



oo 

x n e -yx a ^ n > 0, 



is a moment sequence on the positive semi-axis. A residue integral argument 
implies 

f CO 

x n sin (72" tan(7ra)) e" 7 ^ dx = 0, n > 0. 



Hence 

fOO 

a, 



poo 

/ x n (l + t sin(7a; a tan(7ra))) e" 71 " dx, 
Jo 
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for all n > and t G (—1,1). This shows that the moment sequence (a n ) 
does not uniquely determine \i even knowing its support is [0, oo). □ 

Summing up the above ideas, we have bijective correspondences between 
the following sets (C+ stands for the open upper half plane): 

A) . Rapidly decaying positive measures fx on the real line; 

B) . Analytic functions F : C+ — ► C+, satisfying sup 4>1 \tF(it)\ < oo; 

C) . Self-adjoint operators A with a cyclic vector^. 

More precisely: 

F(z) = ((A-z)-^,0= T zGC + . 

J — oo *^ Z 

The moment sequence = x k d/j,(x), k > 0, appears in the asymptotic 
expansion of F, at infinity, but it does not determine F, (A, £) or fi. For fur- 
ther details about Hamburger and Stieltjes moment problems see Akhiezer's 
monograph [Akh65]. 

3.3. Several variables. The moment problem on M 9 , g > 1, is consid- 
erably more difficult and less understood. Although we have the general 
correspondence remarked in Proposition 13.11 the gap between a commuting 
tuple of unbounded symmetric operators and a strongly commuting one (i.e. 
by definition one possessing a joint spectral measure) is quite wide. A va- 
riety of strong commutativity criteria came to rescue; a distinguished one, 
due to Nelson [N59j . is worth mentioning in more detail. 

Assume that L : M.[xi, x g ] — * M is a functional satisfying (the non- 
negative Hankel form condition) £|e 2 r[;c] ^ 0. We complexify L and asso- 
ciate, as usual by now, the Hilbert space H with inner product: 

(p,q) = L{pq), c[x]. 

The symmetric multipliers M x , commute on the common dense domain 
T> = C[x] C H. Exactly as in the one variable case, there exists a positive 
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measure [i on M. 9 representing L if and only if there are (possibly unbounded) 
self-adjoint extensions M x , c4i 1 < k < n, commuting at the level of their 
resolvents: 

[(A k -z)-\(A J -z)- 1 } :={A k -z)-\A 3 -z)- 1 - (Aj - z)' 1 (A k - z)' 1 = 0, 

for Qz > 0, 1 < j, k < n. 

See for details |F83j . Although individually every M x . admits at least one 
self-adjoint extension, it is the joint strong commutativity (in the resolvent 
sense) of the extensions needed to solve the moment problem. 

Nelson's theorem gives a sufficient condition in this sense: if {l + x{ + ...+ 
Xg)T> is dense in H, then the tuple of multipliers {M X1 , ...,M Xg ) admits an 
extension to a strongly commuting tuple of self-adjoint operators. Moreover, 
this insures the uniqueness of the representing measure [i. For complete 
proofs and more details see |Berg87, F83J. 



A tantalizing open question in this area can be phrased as follows: 

Open problem. Let (c a +a) a R£?$g be a positive semi-definite Hankel 
form. Find effective conditions insuring that (c a ) are the moments of a 
positive measure. 

Or equivalently, in predual form, find effective criteria (in terms of the 
coefficients) for a polynomial to be non-negative on M 9 . 

We know from Tarski's principle that the positivity of a polynomial is de- 
cidable. The term "effective" above means to find exact rational expressions 
in the coefficients which guarantee the non-negativity of the polynomial. 

We do not touch in this discussion a variety of other aspects of the 
multivariate moment problem such as uniqueness criteria, orthogonal poly- 
nomials, cubature formulas and the truncated problem. See for instance 
|Berg87j [Birg9T] IUF051 IG V611 IKM70] . 



3.4. Positivstellensatze on compact, semi-algebraic sets. Now we 

look at a very popular classes of Positivstellensatze. The hypotheses are 
more restrictive (by requiring bounded sets) than the general one, but the 
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conclusion gives a simpler certificate of positivity. The techniques of proof 
are those used in the multivariate moment problem but measures with com- 
pact semi-algebraic support allow much more detail. 

To state the theorems in this section requires the notions of preorder, 
PO(F) and of quadratic module which we now give, but the treatment of 
them in Section 15,31 on the general Positivstellensatz gives more properties 
and a different context than done here. Let F = {/i, f p } denote a set of 
real polynomials. The preordering generated by F is 

PO(F) = { ]T s a f Sa £ S 2 i?[x]}. 

o-e{o,i} r 

The quadratic module generated by F is defined to be: 

QM{F)= /S 2 R[x]. 
/eFu{i} 

We start with a fundamental result of Schmiidgen, proved in 1991 ( [S91] ). 
which makes use in an innovative way of Stengle's general Positivstellensatz. 

Theorem 3.7 (Schmiidgen). Let F = {fi, f p } be a set of real polynomials 
in g variables, such that the non-negativity set Dp is compact in M g . Then 

a) . A functional L G M[x]' is representable by a positive measure supported 
on K if and only if 

£|po(F) > 0. 

b) . Every positive polynomial on Vp belongs to the preorder PO{F). 

Due to the compactness of the support, and Stone- Weierstrass Theorem, 
the representing measure is unique. We will discuss later the proof of b) in 
a similar context. 

We call the quadratic module QM{F) archimedean if there exists C > 
such that 

C-x\- ... -x] £ QM{F). 

This implies in particular that the semi-algebraic set T>p is contained in 
the ball centered at zero, of radius \[C. Also, from the convexity theory 
point of view, this means that the convex cone QM{F) C M[x] contains 
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the constant function 1 in its algebraic interior (see [K69] for the precise 
definition). If the set T>p is compact, then one can make the associated 
quadratic module archimedean by adding to the defining set one more term, 
of the form C — x\ — ... — x 2 g . 

The key to Schmiidgen's Theorem and to a few forthcoming results in 
this survey is the following specialization of Proposition 13.11 

Lemma 3.8. |P93] Let F be a finite set of polynomials in M[x] with as- 
sociated quadratic module QM(F) having the archimedean property. There 
exists a bijective correspondence between: 

a) . Commutative g-tuples A of bounded self-adjoint operators with cyclic 
vector £ and joint spectrum contained in Dp; 

b) . Positive measures [i supported on Dp; 

c) . Linear functionals L G M.[x]' satisfying L\q M {f) > 0. 

The correspondence is constructive, given by the relations: 
L{ P ) = (p(A)(,0 = [ pdn, P eR[x}. 

Proof. Only the implication c) a) needs an argument. Assume c) holds 
and construct the Hilbert space H associated to the functional L. Let M = 
(M X1 , M Xg ) denote the tuple of multiplication operators acting on H. 
Due to the archimedean property, 

{(C-xj-...-x 2 g ) P , P ) >0, P eC[x], 

whence every M Xk is a bounded self-adjoint operator. Moreover, the condi- 
tion 

(fjP,p)>0, peC[x], 

assures that fj(M) > 0, that is, by the spectral mapping theorem, the joint 
spectrum of M lies on Dp. Let Em be the joint spectral measure of M. 
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Then 

L(p) = [ p(x)(E M (dx)l,l), 
Jv F 

and the proof is complete. ■ 

For terminology and general facts about spectral theory in a commutative 
Banach algebra see [D03] . 

With this dictionary between positive linear functionals and tuples of com- 
muting operators with prescribed joint spectrum we can improve Schmiidgen's 
result. 

Theorem 3.9 ([P93J). Let F be a finite set of real polynomials in g vari- 
ables, such that the associated quadratic module QM(F) is archimedean. 
Then a polynomial strictly positive onVp belongs to QM(F). 

Proof. Assume by contradiction that p is a positive polynomial on T>p 
which does not belong to QM(F). By a refinement of Minkowski separation 
theorem due to Eidelheit and Kakutani (see |K69j ). there exists a linear 
functional L G M.[x]' such that L(l) > and: 

L{p) <0< L(q), q £ QM(F). 

(Essential here is the fact that the constant function 1 is in the algebraic 
interior of the convex cone QM(F)). Then Lemma 13.81 provides a positive 
measure n supported on Vp, with the property: 

L{p) = pdp< 0. 
Jv F 

The measure is non-trivial because 

L(l) = viV F ) > 0, 



and on the other hand p > on Dp, a contradiction. ■ 

An algebraic proof of the latter theorem is due to Jacobi and Prestel, see 
[PDOlj . 
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4. Complex variables 

The operator theoretic counterpart to positive functionals described in 
the previous section becomes more transparent in the case of complex vari- 
ables. We present below, closely following |P06j . a series of generalizations 
of Lemma 13.81 and Theorem 13.91 It is at this point when von Neumann's 
inequality becomes relevant. 

Throughout this section z = (zi,...,z g ) denote the complex coordinates 
in C 9 . Then real coordinates of the underlying space M. 2d are denoted by 
x = (xi, X2n), where Zj. = x^ + ^n+fe- We will work as before in the 
polynomial algebra C[x] = C[z,~z], and consider there the convex hulls of 
non-negative polynomials: 

S 2 = co{p 2 ; p G R[x}}, 

and 

Z 2 h = co{\q\ 2 ; qeC[z]}. 

It is easy to see that the cone of hermitian positive squares E| is a proper 
subset of £ 2 . Indeed, remark that 

kl 2 >o, qeC[z], 



dz k dz k 

while the same Laplace operator has negative values on (y^ — x 2 .) 2 . 

Let F = {pi, ...,p m } be a finite subset of M[x] so that the basic semi- 
algebraic set 

V F = {x G R 2d ; Pl (x) > 0, ..., Pm (x) > 0} 

is compact. Let p be a real polynomial which is positive on T>p. We will 
indicate conditions which insure the decompositions: 

(4.1) P GS 2 + m S 2 + ...+p m S 2 , 
or 

(4.2) P G S 2 + Pl S 2 + ...+p m S 2 , 
or 

(4.3) p G QM(F) = S 2 + Pl T? + ... + p m Y?. 
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The last one is covered by Theorem 13.91 The other two require stronger 
assumptions on p, as we shall see below. 

We start by recalling an early, unrelated observation about strictly posi- 
tive hermitian polynomials |Q68|. 

Theorem 4.1 (Quillen). If a bi-homogeneous polynomial p £ C[z,z] satis- 
fies p(z,z) > for z / 0, then there exists N £ N such that: 

\ z r P {zrz)eTi. 

This result was rediscovered, and obtained by a different method, by 
Catlin and d'Angelo [CD96]. Their approach proved to be geometric in 
its nature and very flexible, leading to a variety of refinements of Quillen's 
theorem, see for instance [CD99, d'A05, d'AV03]. This line of research, 
not developed in the present survey, culminates in completely removing the 
strict positivity assumption. Specifically, the following characterization of 
bi-homogeneous quotients of polynomials of was recently discovered. 

Theorem 4.2 (Varolin, [Var06]). Let the bi-homogeneous polynomial p £ 
C[,z,z] satisfy p(z,z) > for z £ C 9 . Write p(z,z) = ^2 a ^p a ^z a z^ and, 
using the positive and negative spectral subspaces of the Hermitian matrix 
(Pa,p), decompose p = p + — p-, with p± £ S^. 

Then there are non-zero polynomials s,t G with the property 

sp = t 

if and only if there exists a positive constant C , such that 

P++P- < C{p+-p-) 

on C 9 . 

Next we return to the compact semi-algebraic set T>f C C 9 and the three 
levels of positivity certificates described by the preceding convex cones. 

We recall that a commutative (7-tuple of linear bounded operators T acting 
on a Hilbert space H is called subnormal if there exists a larger Hilbert 
space H C K and a commutative 5-tuple of normal operators N acting on 
K, so that every Nj leaves H invariant and Nj\x = Tj, 1 < j < g. A 
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commutative g tuple N of normal operators Nj = Xj + iX g+ j consists of 2g 
mutually commuting self-adjoint operators X\, A~2 9 . Hence N possesses 
a joint spectral measure En, supported on the joint spectrum a(N) C C 9 . 

Assume from now on that the cone E?+piE? + ...+p m E? is archimedean, 
that is, after a normalization: 

1 - |zi| 2 - ... - \z g \ 2 G El+piT,l + ... +p m T,l. 

Let L £ C[x]' be a linear functional satisfying 

Instead of constructing the completion of the whole ring of real polynomials, 
we consider the same L-inner product, defined only on complex polynomials 
C[z], in the variables z. Let H be the associated Hilbert space. The multi- 
plication operators M z . act and commute on H. They are bounded due to 
the above mentioned archimedean property: 

||M,^(z)|| 2 = L(|^(z)| 2 )<L(|g(z)| 2 ) = || (? || 2 . 

The only difference to the framework of the multivariate Hamburger moment 
problem is that now M Zj are not necessarily self-adjoint operators and the 
commutators [M Zj , M*J may not vanish. The constant function vector 1 
remains cyclic, in the sense that the span of the vectors M^...Mzfl is the 
whole Hilbert space H. 

Let M = (M Zl , ...,M Zg ) be the commutative n-tuple of operators we have 
just constructed. For a polynomial p(z,z) we define after Colojoara and 
Foias, cf. ([AM02]), the hereditary functional calculus p(M, M*) by putting 
all adjoints M* k in the monomials of p to the left of the powers of M z .'s. 
For example, 

(|*53z5)(M) = M* Zl M* Z3 M zl Ml. 
We have thus established the first part of the following dictionary. 

Proposition 4.3. Let F = {p\, ...,p m } be a finite set of real polynomials in 
2g variables, such that 

1 - \ Zl \ 2 - ... - \Zg\ 2 £ S 2 + Pl S 2 + ... +P m S 2 . 
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a) . There exists a bijective correspondence between Junctionals L £ C[x]' 
which are non-negative on £ 2 +piS? + ... + p m £? and commutative g-tuples 
of linear Hilbert space operators T, with a cyclic vector £, subject to the 
conditions 

Pj(T,T*) >0, l<j<m. 

b) . If, in addition L is non-negative on the larger cone T? -\- piT^ + ... + 
p m E?, then, and only then, T is also subnormal. 

In both cases, the correspondence is given by 

L(p(z,z)) = (p(T,T*)t,0- 

Proof. For the proof of part b) remark that the Hilbert space completion 
H of the ring of complex polynomials with respect to the inner product 
iPiO) = L(pq), p,q G C[z] carries the bounded, commutative g-tuple M of 
multiplication operators with the variables z\,...,z g . The positivity of the 
functional L on X 2 is equivalent to the multivariate analog of the Halmos- 
Bram subnormality condition applied to M. See for details [Dem02t [P93] . 

Conversely, if T is a cyclic subnormal tuple of operators with normal 
extension N, and p(z,z) is a polynomial, then 

(\p\ 2 (T,T*)£,0 = (\p\ 2 (N,N*)C,0 = \\p(N,N*)t\\ 2 > 0. 

This uses the very definition of the hereditary calculus, for example, 

(TfTfTfTit £> = (TfT^, 2?2&> = (TfN^, TfN^) 

{N*N*£, N?N*0 = \\N?N^\\. 

m 

The following translation of the proposition shows that the class of all 
commutative tuples of operators serves as a better "spectrum" for the poly- 
nomial algebra in the variables (z,z). 

Corollary 4.4. Let F be as in the Proposition and let p{z,~z) be a poly- 
nomial. If p(T,T*) > for every commutative g -tuple of linear Hilbert 
space operators T, satisfying pj(T,T*) > 0, 1 < j < m, then p belongs to 

^t+Pl^t + .-.+Pm^l 
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The cyclic vector condition is not relevant for this statement. 
Proof. The proof follows from now a known pattern. Assume by contra- 
diction that p ^ + pi£ 2 + ... +p m T,j i . By Minkowski-Eidelheit-Kakutani 
separation theorem, there exists a linear functional L G C[x]' satisfying the 
conditions of Proposition 14.31 and L(p) < < L{\). Then the commutative 
g-tuple M associated to the inner-product space defined by L satisfies 

<p(M,M*)l,l)=L(p)<0, 

a contradiction. ■ 

Even the simple case of the unit ball or unit polydisk in C 9 is interest- 
ing from this perspective. Assume first that n = 1. According to von- 
Neumann's inequality, 

\\P(T)\\<1, 

whenever T is a contraction and sup^jj \p(z)\ < 1. Thus, in view of the above 
proposition, for every polynomial p(z) and constant M > sup 2g p |p(#)|, we 
have 

M 2 -|p(z)| 2 GSl + (l-|z| 2 )^. 

Needless to say that this statement is equivalent to von-Neumann's inequal- 
ity. 

In complex dimension two, a celebrated theorem of Ando (see [AM02, 
ICW99j ) asserts that, for every pair of commuting contractions (T\,T2) and 
every polynomial p(zi, z<i) one has a von-Neumann type inequality: 

||p(Ti,T 2 )|| < HplU^. 

And a not less celebrated example of Varopoulos (see again [AM02] ) shows 
that it is no more the case in dimension n = 3 and higher. Specifically, 
according to our corollary, for every polynomial p{z) and e > 0, we have 

(IIpIL^ + e) 2 - W)\ 2 e^ + (i- N 2 )£ 2 + (1 - |z 2 | 2 )S 2 , 

but the statement is not true (for the unit polydisk) in higher dimensions. 
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Open problem. It would be interesting to find an algebraic explanation 
of Ando's Theorem via the above equivalent sums of squares decomposition. 

On the other hand, by exploiting part b) of Proposition 14.31 one can prove 
a sharper weighted sums of squares decomposition on complex analytic poly- 
hedra. 

Theorem 4.5. Let S = {z 6 C 9 ; \pj(z)\ < 1, 1 < j < m} be a compact semi- 
algebraic set, where pj are complex polynomials. Assume that the convex 
cone 

C = X 2 + (l-\p 3 \ 2 )Z 2 h + ... + (l-\p m \ 2 )Xl 

is archimedean. Then every real polynomial p which is strictly positive on S 
belongs to C . 

For a proof see [P931 IP06| . The article [BGM05] contains a similar ap- 
proach to bounded analytic multipliers on the Hardy space of the bi-disk. 
The recent note [P06J contains a few other sums of squares translations of 
some recently proved inequalities in operator theory. 

Finally, we reproduce from [HMP06] the following general Nichtnega- 
tivstellensatz over the complex afhne space. Note that when evaluating 
on tuples of commutative matrices, we do not have to impose the strict 
positivity of the polynomial to be decomposed. 

Theorem 4.6. Letp(z,~z) be a real valued polynomial, where z£C s . Then 
there are polynomials qi G C[z], 1 < % < k, with the property 

k 

v{z,z) = ^2 h( z )\ 2 i 

i=l 

if and only if, for all tuples of commuting matrices X = {X\, X g ) G 
Mrf(C), d > 1, we have 

p(X,X*)>0. 



The proof follows the general scheme outlined in this section and we omit 
it. See [HMP06] for full details. 
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5. Real algebra and mathematical logic 

Keeping in mind the main theme of our essay (sums of squares decom- 
positions), we briefly recall below, without aiming at completeness, some 
classical facts of real algebra and mathematical logic. We follow an ap- 
proximate chronological order. For a more comprehensive, recent and very 
authoritative survey of real algebra and real algebraic geometry aspects of 
sums of squares we refer to Scheiderer [S03]. 

5.1. Minkowski and Hilbert. In the same time to, and even before, the 
analysis aspects of sums of squares decompositions we have discussed have 
been discovered, similar questions have appeared in number theory and alge- 
bra. Lagrange's famous theorem (that every positive integer can be written 
as a sum of squares of four integers) was the origin of many beautiful studies, 
see Chapter XX of Hardy and Wright's monograph [HW93J. 

According to Hilbert [Hilbl910] . after the teenager Minkowski won the 
1882 "Grand Prix" of the French Academy of Sciences on a theme related 
to Lagrange's four squares theorem, he has started working in Konisberg on 
his thesis devoted to quadratic forms of a higher number of variables. It was 
in Minkowski's inaugural dissertation, with Hilbert as opponent, that he 
remarked that "it is not probable that every positive form can be represented 
as a sum of squares" [M86J. 

The opponent (Hilbert) produced the first (non-explicit) example, see 
[Hilbl888j. His idea is the following. Consider nine points a%, ag in R 2 , as 
the base of a pencil of cubics (that is, a family of curves obtained via a linear 
combination of their third degree defining equations), so that every cubic 
polynomial vanishing at the first eight points a±,...,as will automatically 
vanish at ag. By a rather involved geometric argument, one can prove the 
existence of a polynomial p(x,y), of degree six, which is non-negative on 
M 2 , vanishes at a\, ...,a% and satisfies p(ag) > 0. Then clearly p cannot be 
written as a sum of squares of polynomials: 



p = q\ + ... + q%, 
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because every qi would have degree at most three and therefore would be 
null at ag, too. Hilbert's argument is reproduced at the end of Chapter II 
of Gelfand and Vilenkin's monograph |GV61j . 

The first explicit example based on Hilbert idea was constructed by Robin- 
son in 1969, see |R92j for details. Robinson's homogenized polynomial is : 

P{x,y,z) = x 6 + y 6 + z 6 - 

r 4 2 i 24, 42, 24, 42, 2 4\ , o 2 2 2 

— (xy + x y + x z + x z +y z +yz) + 6xyz. 

About the same time (some six dozen years after Hilbert's article) Motzkin 
has produced a very simple polynomial (again shown in homogenized form): 

Q(x, y, z)=z 6 + x 2 y 2 (x 2 + y 2 - z 2 ). 

The reader will find easily why Q is non-negative, but not a sum of squares. 
More examples of non-negative polynomials which are not sums of squares 
were discovered by Choi, Lam and Reznick, and separately Schmiidgen. 
We refer to Reznick's monograph [R92] for more details and for an elegant 
geometric duality method (based on the so called Fisher inner product) 
adapted to the analysis of the convex cones of such polynomials. 

One of Hilbert's celebrated 1900 problems in mathematics was about the 
structure of positive polynomials and the logical implications of the exis- 
tence of a constructive way of testing positivity. Here are his words: 

Problem 1 7. Expression of definite forms by squares: 

"...the question arises whether every definite form may not be expressed 
as a quotient of sums of squares of forms... 

... it is desirable, for certain questions as to the possibility of certain 
geometrical constructions, to know whether the coefficients of the forms to 
be used in the expression may always be taken from the realm of rationality 
given by the coefficients of the form represented." |Hilbl9 10j. 
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Hilbert's intuition proved to be correct on both conjectures raised by his 
question. His query was the origin of a series of remarkable results in algebra 
and logic, see |BCR98l IPDUTI IS05j . 

5.2. Real fields. Hilbert's 17-th problem was solved in the affirmative by 
E. Artin |Art26] in 1927, as an application of the theory of real fields he has 
developed with Schreier }AS26j . For history and self-contained introductions 
to real algebra, and complete proofs of Artin's Theorem, we refer to either 
one of the following monographs [BCR98J IJ89L IPDOlj . We merely sketch 
below the main ideas of Artin-Schreier theory (as exposed in Jacobson's 
algebra book [J89] ). to serve as a comparison basis for the computations we 
will develop in later sections in the framework of star algebras. 

An ordered field is a characteristic zero field F with an ordering, that is 
a prescribed subset P of positive elements, satisfying: 

P + PCP, P-PcP, F = PU{0}U{-P}. 

Since a 2 G P for all a ^ 0, if J2i a 1 = 0) then every a, = 0. Or equivalently, 
— 1 cannot be written as a sum of squares in F. By a theorem of Artin and 
Schreier, every field with the latter property can be ordered. An ordered 
field R is real closed if every positive element has a square root. In this case, 
exactly as in the case of real numbers, the extension R(\/—l) is algebraically 
closed. 

A central result in Artin-Schreier theory is the existence and uniqueness 
of the real closure R of an ordered field F: that is the extension F C R 
is algebraic and x £ F is positive in R if and only if is positive in F. In- 
terestingly enough, the proof of this fact uses Sturm's algorithm for the 
determination of the number of roots of a polynomial with real coefficients 
(or more generally with coefficients in a real closed field). 

Sturm's algorithm. Let R be a real closed field and let 



p(x) = clq + a\x + ... + x 6 R[x] 
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be a polynomial. Let 

C = 1 + |ad_i| + ... + | ai| + \a \. 

Define the sequence of polynomials: 

Po = Pi Pi = p' (the derivative), 

Pj+i = Pjlj ~ Pj-h degp j+ i < degpj. 
Then for a large n, p n = 0. Sturm's Theorem asserts that the number of 
roots of p in R is N(—C) — N(C), where N(a) is the number of sign changes 
in the sequence po(a),pi(a), ...,p n (a). 

As an application of the existence of the real closure of an ordered field, 
one can prove that an element x £ F is a sum of squares if and only if it 
is positive in every order on F. Or, equivalently, if y £ F is not a sum of 
squares, then there exists an order on F with respect to which y < 0. 

Theorem 5.1 (Artin). Let F be a subfield o/M which has a unique ordering 
and let f be a rational function with coefficients in F. If f(a\, a g ) > 
for all (ai, ...,a 9 ) G F 9 for which f is defined, then f is a sum of squares of 
rational functions with coefficients in F. 

The idea of the proof is to admit by contradiction that / is not a sum 
of squares, hence it does not belong to an ordering of the field of rational 
functions K = F(xi, ■■■,x g ) in g variables. By completing K to a real closed 
field R, one finds "ideal" points bi,...,b g £ R, so that f(bi,...,b g ) < 0. By 
Sturm's counting theorem one shows then that there are points a±, ...,a g E F 
with the property f(ai,...,a g ) < 0. For details see for instance §11.4 in [J89J. 

Examples of fields with a unique ordering are Q and R. Artin's Theorem 
prompts a series of natural questions, as for instance : how many squares 
are necessary, is there a universal denominator in the decomposition of / 
as a sum of squares, are there degree bounds? All these problems were 
thoroughly studied during the last decades, [BCR981 |PD01| . 

It was Tarski who in the late 1920-ies put Sturm's Algorithm into a very 
general and surprising statement. His work had however an unusually long 
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gestation and has remained unknown to the working mathematician until 
mid XX-th Century. His articles are available now from a variety of parallel 
sources, see for instance [T86 and the historical notes in [PDOlj . His main 
thesis is contained in the following principle, cited from his original 1948 
RAND publication [T86] : 

Tarski's elimination theory for real closed fields. "To any formula 
4>(xi, Xg) in the vocabulary {0,1, +,-,<} and with variables in a real 
closed field, one can effectively associate two objects: 

(i) a quantifier free formula (f>(xi, x g ) in the same vocabulary, and 

(ii) a proof of the equivalence 4> = <t> that uses only the axioms of real 
closed fields." 

He aimed this theorem at the completeness of the logical system of ele- 
mentary algebra and geometry, very much in the line of Hilbert's programme 
in the foundations of mathematics. As a consequence one obtains the trans- 
fer principle alluded above, in the proof of Artin's Theorem: let R\ C R2 be 
real closed fields. A system of polynomial inequalities and equalities with 
coefficients in R\ has a solution in R2 if and only if it has a solution in R\ . 

Let R be a real closed field. We recall that a semi-algebraic set in R 9 is 
a finite union of finite intersections of sets of the form 

{x e R 9 ; p(x) = 0}, {x e R 9 ; p{x) > 0}, or {x G R 9 ; p{x) > 0}. 

A self-contained account of Tarski's theorem can be found in |S54| . See 
also [BCR98, J89, PDOlj. In practice, the most useful form of Tarski's result 
is the following theorem. 

Theorem 5.2 (Tarski-Seidenberg). If R is a real closed field and S is a 
semi-algebraic set in R 9 x R m , then the projection of S onto R 9 is also 
semi-algebraic. 



For a self contained proof of the above theorem see the Appendix in [Horm83j . 
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Applications of Tarski's principle came late, but were spectacular. We 
only mention for illustration one of them: 

Hormander's inequality. (1955) : For every polynomial 
f{x\, ...,x g ) £ M.[xi, —,x g ] there are positive constants c,r such that 

\f(x)\ > c dist(x, V(f)) r , x E R 9 , \x\ < 1. 

Above V(f) stands for the real zero set of /. The inequality was generalized 
to real analytic functions by Lojasiewicz in 1964, and served as the origin 
of fundamental discoveries in modern analytic geometry and the theory of 
partial differential operators, see for instance [BCR98 , IHorm83] . 

5.3. The general Positivstellensatz. A great jewel of real algebraic ge- 
ometry, which is now causing excitement in applications, is the Positivstellen- 
satze and the real Nullstellensatz (which it contains). This section states 
these theorems and can be read independently of earlier parts of this paper. 

The Positivstellen-satze lives in a polynomial ring with coefficients in a 
real closed field (as opposed to the complex numbers) and were discovered 
only in the 1960-ies (see |Dub6 9 , K64J) before being rediscovered and refined 
by Stengle in 1974 ( |S74j ). The statement of the Nullstellensatz departs from 
Hilbert's Nullstellensatz over an algebraically closed field, by imposing an 
additional sum of squares term in the characterization of a radical ideal, as 
we shall see below. It is interesting to remark that Stengle's article makes 
specific references, as origins or motivations of his investigation, to works in 
mathematical logic and mathematical programming. 

In order to state Stengle's Positivstellen-satze we need first a few defini- 
tions and conventions. Let R be a real closed field (many readers will be 
happy to think of R as the real numbers) and denote x = (x%, ...,x g ) £ R 9 
and also regard x as a g-tuple of commuting indeterminates. Let T, 2 A denote 
the set of all sums of squares in the algebra A . 

Let S C R[x] be a subset, and write 

V s = {x e R 9 ; p{x) > 0, Vp G S}, 
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for the positivity set of of the functions S. If S = {pi, ...,p r } is finite then 
T>s is a basic closed semi-algebraic set . The preordering generated by S is 

PO(S) = { s*PV-Pr r \ s a eZ 2 R[x]}. 

The quadratic module generated by S is 

QM(S) = Y?R[x\ + Pl Y?R[x] + ...+ Pr Y?R[x\. 

Note that a preordering satisfies conditions similar to an ordering in a 
field: 

PO{S) + PO(S) C PO(S), PO(S) ■ PO(S) C PO(S), T, 2 R[x] c PO(S), 

while the quadratic module fails to be closed under multiplication, but still 
satisfies: 

QM{S)+QM{S) C QM(S), T?R[x] C QM(S), Z 2 R[x]-QM(S) C QM(S), 
and clearly, 

QM(S) C PO(S). 



Theorem 5.3 (Stengle). Let R be a real closed field and letpi,...,p r £ 
R[x±, x g \. Let S = V(p\, ...,p r ) and let T = PO(p±, ...,p r ) be the preorder 
generated by pi. Let f £ R[x\, ...,x g ]. Then 

(a) . / > on S if and only if there are s,t € T satisfying sf = 1 + 1; 

(b) - f > on S if and only if there are s,t G T and an integer N > 0, 
with the property sf = f 2N + t; 

(c) . f = on S if and only if there exists an integer N > with the 
property —f 2N G T. 
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We derive a few particular consequences. For instance, the following real 
Nullstellensatz is contained in the preceding result: assume that p,q G R[x] 
and 

(p(x)=0) =>(q(x) = 0). 

Then point (c) applies to / = q and p\ = —p 2 . We infer: there exists N > 
such that 

2N 2 „ r 2n[ 1 

—q = si - p s 2 , si, s 2 G 2, R[x\, 

therefore: 

q 2N + s G (p), s G S 2 ii[x], 

where (p) denotes the ideal generated by p. Obviously, if the latter condition 
holds, then q vanishes on the zero set of p. Thus, we have proved: 

{(p(x) = 0) {q(x) = 0)] & [3(N > 0, s G Z 2 R{x}) : g 2iV + s G (p)]. 

Variants of this are pleasurable, and we suggest as an exercise the reader 
repeat the above but take f = q and p\ = p, P2 = ~P- 

As another example, assume that p(x) > for all x G R 9 . Then the 
theorem applies to / = p and p\ = 1 and we obtain: there exists an integer 
N > and elements s\,S2 G T, 2 R[x], such that 

s\p = p 2N + s 2 . 

In particular, 

s\p G T, 2 R[x], 

which is exactly the conclusion of Artin's Theorem. 

The concepts of (pre)ordering and quadratic module can be defined for 
an arbitrary commutative ring with unit; these, together with the important 
construct of the real spectrum provide the natural framework for developing 
modern real algebra and real algebraic geometry. For the general versions 
of the Satze outlined in this section the reader can consult as a guide [S03J, 
and for complete details and ramifications, the monographs [BCR98, PD01J. 
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6. Applications of semi-algebraic geometry 

The prospect of applying semi-algebraic geometry to a variety of areas 
is the cause of excitement in many communities; and we list a few of them 
here. 

6.1. Global optimization of polynomials. An exciting turn in the un- 
folding of real algebraic geometry are applications to optimization. To be 
consistent with the non-commutative setting of the subsequent sections we 
denote below by x G R 9 a generic point in Euclidean space, and in the same 
time the g-tuple of indeterminates in the polynomial algebra. 

6.1.1. Minimizing a Polynomial on R 9 . A classical question is: given a poly- 
nomial q € M.[x], find 

min q(x) 

xeRf 

and the minimizer x opt . The goal is to obtain a numerical solution to this 
problem and it is daunting even in a modest dimension such as g = 15. 
Finding a local optimum is numerically "easy" using the many available 
variations of gradient descent and Newton's method. However, polynomials 
are notorious for having many many local minima. 

A naive approach is to grid R 9 , lets say with 64 grid points per dimension 
(a fairly course grid), and compare values of q on this grid. This requires 64 15 
~ 10 9 10 7 function evaluations or something like 10,000 hours to compute. 
Such prohibitive requirements occur in many high dimensional spaces and 
go under the heading of the "curse of dimensionality" . 

The success of sums of squares and Positivstellensatze methods rides on 
the heels of semi-definite programming, a subject which effectively goes back 
a decade and a half ago, and which effectively allows numerical computation 
of a sum of squares decomposition of a given polynomial q. The cost of the 
computation is determined by the number of terms of the polynomial q and 
is less effected by the number g of variables and the degree of q. To be more 
specific, this approach to optimization consists of starting with a number 
q** and numerically solve 
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for s £ X 2 . If this is possible, lower q** according to some algorithm and 
try again. If not, raise q** and try again. Hopefully, one obtains q*° at the 
transition (between being possible to write q — q** as a sums of squares and 
not) and obtains 

q — q*° G £ 2 

and conclude that this is an optimum. This method was proposed first 
by Shor [S87] and subsequently refined by Lasserre [LOlj and by Parrilo 
[ParThesisj . 

Parrilo and Sturmfels l\S0.'5 reported experiments with a special class of 
10,000 polynomials for which the true global minimum could be computed 
explicitly. They found in all cases that q*° determined by sums of squares 
optimization equals the true minimum. 

Theoretical evidence supporting this direction is the following observa- 
tion, see [BCR98] §9. 

Theorem 6.1. Given a polynomial q £ M[x], the following are equivalent: 

(1) q> on the cube [-1, l] s . 

(2) For all e > 0, there is s £ S 2 such that 

\\l - s lli 1 ([-i,i]f ) < £ - 

A refinement of this result was recently obtained by Lasserre and Netzer 
[LN06] . Namely, the two authors prove that an additive, small perturba- 
tions with a fixed polynomial, produces a sum of squares which is close to 
the original polynomial in the L 1 norm of the coefficients. We reproduce, 
without proofs, their main result. 

Theorem 6.2. [LN06] Let p EM[x%, ...,x g ] be a polynomial of degree d, and 
let 

® r = l+X 2 { + ... +X 2r , 

where r > d/2 is fixed. Define 

e* r = min{L(p); L G R 2 r[xi, x g ]', L(0 r ) < 1, L|| > 0}. 
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Then e* < and the minimum is attained. The polynomial 

Pe,r =p + €@ r 

is a sum of squares if and only if e > — e*. 

Moreover, if the polynomial p is non-negative on the unit cube [—1,1] 9 , 
then lim^oo e* = 0. 

Variations of the above theorem, with supports on semi-algebraic sets, 
relevant examples and an analysis of the degree bounds are contained in the 
same article |LN06j . 

For quite a few years by now, Lasserre has emphasized the tantamount im- 
portance of such perturbation results for optimization using sums of squares 
(henceforth abbreviated SOS) methods, see |L01| . in that it suggests that 
determining if a given p is nonnegative on a bounded region by computing 
a sums of squares has a good probability of being effective. 

We shall not prove the stated perturbation results, but remark that a free 
algebra version of them holds, [KS05J. 

In the opposite pessimistic direction there are the precise computations 
of Choi-Lam- Reznick (see |R92j ) and a recent result due to Bleckermann 
[Blec04j . 

As a backup to the above optimization scheme, if a q — q*° E £ 2 fails to 
be a sum of squares, then one can pick a positive integer m and attempt to 
solve 

(l + \x\ 2 ) m (q-q*°) GS 2 . 

Reznick's Theorem |R95j tells us that for some m this solves the optimiza- 
tion problem exactly. Engineers call using the term with some non zero 
m "relaxing the problem", but these days they call most modifications of 
almost anything a "relaxation" . 

6.1.2. Constrained optimization. Now we give Jean Lasserre's interpretation 
of Theorem 13.91 Let V denote a collection of polynomials. The standard 
constrained optimization problem for polynomials is: 
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minimize q(x) subject to x £ V-p := {x £ JR s ;p(:z;) >0, T 7 }. 

Denote the minimum value of q by g op< . We describe the idea when V 
contains but one polynomial p. Assume Vp(x) does not vanish for x £ &D p . 

The standard first order necessary conditions for x op * £ <9D-p to be a local 
solution to this problem is 

Vq(x opt ) = XVp(x opt ) 

with A > 0. We emphasize, this is a local condition and A is called the 
Lagrange multiplier. 

Now we turn to analyzing the global optimum. Suppose that q can be 
expressed in the form: 

q-q = si + s 2 p, si,2 E 5J , 

which implies q(x) > q** for all x S T> p . So g** is a lower bound. This 
is a stronger form of the Positivstellensatz than is always true. Then this 
optimistic statement can be interpreted as a global optimality condition 
when q** = q opt . Also it implies the classical Lagrange multiplier linearized 
condition, as we now see. At the global minimum x opt we have 

= q(x opt ) - q opt = Si{x opt ) + s 2 (x opt )p{x opt ) 

which implies = si(x opt ) and, since si is a sum of squares, we get \7si(x opt ) = 
0. Also s 2 {x opt = 0, Vs 2 {x opt ) = whenever p(x opt ) / 0. Calculate 

Vq = Vsi +pVs 2 + s 2 Vp. 

If p{x opt ) = 0, we get 

Vq{x opt ) = s 2 {x opt )Vp{x opt ) 

and if p(x opt ) ^ we get Vq(x opt ) = 0, the classic condition for an optimum 
in the interior. Set A = s 2 (x° pt ) to get Wp(x opt ) = Vq(x opt ) the classic 
Lagrange multiplier condition as a (weak) consequence of the Positivstellen- 
satz. 

The reference for this and more general (finitely many pj in terms of the 
classical Kuhn- Tucker optimality conditions) is [L01] Proposition 5.1. 
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Also regarding constrained optimization we mention that, at the technical 
level, the method of moments has re-entered into polynomial optimization. 
Quite specifically, Lasserre and followers are relaxing the original problem 

min q(x) 

x£T> 

as 

min / qdfj,, 
v Jv 

where the minimum is taken over all probability measures supported on D. 
They prove that it is a great advantage to work in the space of moments (as 
free coordinates), see [HL051 lLOTI IL04] . 

6.2. Primal-dual optimality conditions. In this section we explain in 
more detail Lasserre's point of view [LOO} IL01| of linearizing polynomial 
optimization via sums of squares decompositions and via moment data seen 
as independent variables. 

Specifically, we start with a polynomial / G M[x] and seek values of the 
scalar A for which / — A G X 2 . To this aim we consider a variable linear 
functional L G M[x]' and denote the corresponding moments 

y a = L(x a ), \a\ < 2d. 

The integer d is fixed throughout the whole section and will not explicitly 
appear in all coming formulas. We denote y = (y a )\ a \<2d an d consider the 
associated Hankel matrix 

My = {Va+p)\a\,\l3\<d- 

In all these considerations it is important to fix an ordering (such as the 
graded lexicographic one) on the multi-indices a. Let 

V(X) = (Ml,"" ,Xg,xl,---) 

be the "tautological" vector consisting of all monomials of degree less than 
or equal to d. Let ao = (0, 0, • • • ,0), so x a ° = 1. 
The matrix valued polynomial 

Y,B a x a = V(x)-V{x) T , 

a 
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produces a sequence of matrix coefficients B a which carry a Hankel type 
structure and whose entries are either or 1. Write 

a 

The next lemma equates "minimization" of a polynomial via a sum of 
squares to a matrix problem. 

Lemma 6.3. The degree d polynomial f — A is a sum of squares with r 
squares of polynomials each having degree < n if and only if there exists a 
positive semi- definite matrix f2 6 JR nxn of rank r such that 

(6.1) ti(B a Q) = f a — A5 ao for all \a\ < d. 

Proof. Write the symmetric positive semi-definite nxn matrix f2 = Y^j QjlJ 
were r is the rank of f2 and qj £ M n . Then 

(6.2) tr(J2 B a x a Q) = £ f a x a - A 

a a 

which gives 

r 

tr( V{x)V{x) T Y,wD = f( x ) ~ A 
3 

and 

Y,qjV(x)V(x) T q,=f(x)-\ 

j 

so we obtain 

J2Qj(x)Q j (x) = f(x)-X 
j 

where Qj{x) := V(x) T qj. The argument reverses, so / a sum of squares 
implies = Y^ljlJ wmcn makes positive semi-definite. ■ 

Clearly, there are many matrices f2 > satisfying (|6.1|) . A canonical 
choice, appearing in the next Lemma, was proposed by Nesterov. 

Lemma 6.4. Suppose there is a positive definite solution to &6.1\) , then one 
of them Cl has inverse, f2 _1 which is a Hankel matrix. 
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Proof. We show that Cl is the "maximum entropy solution" to (|6.ip . 
namely, Cl is the (unique) solution to 



max In det Q subject to (|6.ip . 

It is standard that if a positive definite ft, exists maximizing entropy, then 
it "keeps its eigenvalues positive", so Cl is positive definite. We use the 
standard formula 

d In det(0 + tA) . r » ,„ 
1 *M = -tr[0 A] 

which is for all A satisfying tr[S Q A] = for all a. Now tr[f) _1 A] = says 
that _1 is in the orthogonal complement of the orthogonal complement of 
span {B a }, thus Cl -1 G span {B a }, in other words, it is a Hankel matrix. 
■ 

Our next step is the minimization problem: 

L f := minL(/) subject to L(S 2 ) > and L(l) = 1. 

Clearly, 

Af < min / fd/i = min/(x) 
M J * 

as u ranges over all probability measure. If the minimum is attained, Dirac's 
measure at the optimal x yields A/ = min x f(x). 

Since L = L y corresponding to the moment sequence y satisfies L y (f) = 
^2 a fall a our basic "primal" problem is: 

(PRIMAL) mmS^ f a y a subject to M y > y ao = 1. 

a 

The "dual" problem is 

(DUAL) max A subject to > and ti(B a Q) = f a — \S ao 

for all a which we saw in Lemma 16.31 as solving the sum of squares problem 
is the same as 

As := max A subject to / — A is in E. 
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To derive that these problems are indeed dual to each other define a "La- 
grangian" by 

C(y, fi, A) := ^2 faVa - tr(^ y a B a Q.) - (y ao - 1)A 

a a 

on the set M y > 0, f2 > 0, A G M. For convenience write 

(6.3) C(y, tt, A) := ^ [(/« - tr(^ B a n) - 5 ao X]y a - y ao X. 

a a 

Then 

min max C(y, Q, A) = min V] if 2/a = 1 (is + oo if y ao / 1). 

»— — »— a 

which is the primal problem. Next 

max min £(«, fL A) = maxA if f a — trf-BafJ)— A£ Qn =0 (is —oo otherwise). 

n>o,A M y >0 

which is the dual problem. We summarize with 
Lemma 6.5. Forfl,M y inR nxn , define sets 

N := {fl > 0, A G M : /„ - tr(J3 a fi) - XS ao = aZZ |a| < d} 

M:={y: M y > 0, y ao = 1}. 

T/ien is not empty for some dimension n if and only if f — A is a sum of 
squares and we have 

As := max A = max min C(y, f2, A) 

n,\£Af n,\£Afy€M 

< min max C(y, f2, A) = min N =: Af 

A saddle point f2, A, y is defined as one which satisfies 

As = C(y,Cl,X) = A/. 

The lemma verifies our claim that the problems PRIMAL and DUAL are 
dual with respect to each other. Also if they have a saddle point they have 
the same optimal value A := As = A/. Existence of a saddle point, because of 
the bilinearity of C, is in the perview of the von Neumann Minmax Theorem, 
but we do not discuss this here. We refer the reader to [LOO} IL014 IHL06] for 
further details. Although their approach is a bit different. 
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Now we make a few remarks. Firstly, the saddle point condition 
A = C(y, Q,X) = faVa ~ tr(^ y a B a Cl) - (y ao - 1)A 

a a 

reduces to = trQ^ a y a B a Cl) = tr(Myti). Since M$,£l are both posi- 
tive semi-definite, this forces the "complementarity" of an optimal moment 
matrix and an optimal sum of squares representor 0. 

(6.4) Myh = 0. 

Secondly, an equation "balanced" between primal and dual is 

tr(M y n) = L y (f) - A. 

We will continue this line of thought in a separate article. 

6.3. Engineering. For nonlinear systems, sum of squares techniques can be 
used to find Lyapunov functions by direct computation. Here the problem 
is to check if a differential equation 

ft = a(x) 

on M. 9 is stable. The most common technique is to seek a function V > 
except V(0) = satisfying the differential inequality 

W(x) • a{x) < for all x in R 9 ; 

such V are called Lyapunov functions . If a is a vector field with polynomial 
entries, it is natural to seek V which is a sum of squares of polynomials, 
and this reduces to a semi-definite program. Solution can be attempted 
numerically and if successful produces V a Lyapunov function; if not one 
can modify the sum of squares from polynomials to some rational sum of 
squares and try again, see [ParTh csis] . Such techniques lay out numerical 
solutions to midsized (dimension 8 or so) nonlinear control problems. 

More generally for control problems one seeks to find a feedback law 
u = k(x) which stabilizes 

^ = a(x) + b(x)u. 
at 
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Here there is a function V, beautifully tamed by E. Sontag, called a "control 
Lyapunov function" generalizing the classical Lyapunov function. Unfortu- 
nately, no Positivstellensatz technique is known for for finding V . However, 
A. Rantzer cleverly introduced a "dual control Lyapunov function" and in 
[PPR04J showed that it is quite amenable to sum of squares techniques. 

Another direction gives a generalization of the classical S-procedure which 
finds performance bounds on broad classes of problems, see [ParThesis] . 
There are also applications to combinatorial problems described there. 

Recently, [HL06] have given a technique for converting system engineering 
problems to polynomial minimization. The wide scope of the technique is 
very appealing. 

7. Linear matrix inequalities and computation of sums of 



Numerical computation of a sum of squares and a Positivstellensatz is 
based on a revolution which started about 20 years ago in optimization; the 
rise of interior point methods. We avoid delving into yet another topic but 
mention the special aspects concerning us. Thanks to the work of Nesterov 
and Nemirovskii in the early 1990s one can solve Linear Matrix Inequali- 
ties (LMIs in short) numerically using interior point optimization methods, 
called semi- definite programming . An LMI is an inequality of the form 



where the Aj are symmetric matrices and the numerical goal is to compute 
x £ M s satisfying this. The sizes of matrix unknowns treatable by year 
2006 solvers exceed 100 x 100; with special structure dimensions can go 
much higher. This is remarkable because our LMI above has about 5000g 
unknowns. 

7.1. SOS and LMIs. Sum of squares and Positivstellensatze problems con- 
vert readily to LMIs and these provide an effective solution for polynomials 
having modest number of terms. These applications make efficiencies in 
numerics a high priority. This involves shrewd use of semi-algebraic theory 
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and computational ideas to produce a semi-definite programming package, 
for a recent paper see pQ; also there is recent work of L. Vandenberghe. 
Semi-algebraic geometry packages are: SOS tools [PPSP04J and GloptiPoly 
[HL03] . 

A lament is that all current computational semi- algebraic geometry projects 
use a packaged semi-definite solver, none write their own. This limits effi- 
ciencies for sum of squares computation. 

Special structure leads to great computational improvement as well as 
elegant mathematics. For example, polynomials which are invariant under 
a group action, the delight of classical invariant theory, succumb to rapid 
computation, see |GP04j [CKSprcpt|. 

7.2. LMIs and the world. LMIs have a life extending far beyond compu- 
tational sum of squares and are being found in many areas of science. Later 
in this paper §101 we shall glimpse at their use in systems engineering, a use 
preceding sum of squares applications by 10 years. The list of other areas 
includes statistics, chemistry, quantum computation together with more; all 
to vast for us to attempt description. 

A paradigm mathematical question here is: 



Which convex sets C in M 9 with algebraic boundary can be represented 
with some monic LMI? 



That is, 



C = {x£R 9 : 1 + AtX! + ■■■ A g x g > 0}, 



where Aj are symmetric matrices. Here we have assumed the normalization 
G C. This question was raised by Parrilo and Sturmfels |P S03j . The 
paper |HVprept| gives an obvious necessary condition □ on C for an LMI 
representation to exist and proves sufficiency when g = 2. 



This is in contrast to the free algebra case where all evidence (like that in this paper) 
indicates that convexity is the only condition required. 
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The main issue is that of determinantal representations of a polynomial 
p(x) on R 5 , namely, given p express it in the form 



(7.2) p(x) = det(A + A\x\ H A g x 



a)- 



That this is possible for some matrices is due to the computer scientist Leslie 
Valiant [Val79]. That the matrices can be taken real and symmetric is in 
|HMVprept"1 as is the fact the a representation of det p(X) always holds 
for polynomials in non-commuting (free) variables, as later appear in £JHJ A 
symbolic computer algorithm due to N. Slinglend and implemented by J. 
Shoppie runs under the Mathematica package NCAlgebra. 

The open question is which polynomials can we represent monicaly; that 
is with Aq = I. Obviously, necessary is the real zero condition , namely, 

the polynomial f(t) := p(tx) in one complex variable t 
has only real zeroes, 

but what about the converse? When g = 2 the real zero condition on p 
insures that it has a monic representation; this is the core of |HVprept|. 



What about higher dimensions? Lewis, Parrilo and Ramana [LPR05 
showed that this g = 2 result (together with a counterexample they con- 
cocted) settles a 1958 conjecture of Peter Lax, which leads to the surmise 
that sorting out the g > 2 situation may not happen soon. Leonid Gurvitz 
pointed out the Valient connection to functional analysts and evangelizes 
that monic representations have strong implications for lowering the com- 
plexity of certain polynomial computations. 

8. NON-COMMUTATIVE ALGEBRAS 

A direction in semi-algebraic geometry, recently blossoming still with 
many avenues to explore, concerns variables which do not commute. As 
of today versions of the strict Positivstellensatze we saw in §5.31 are proved 
for a free *- algebra and for the enveloping algebra of a Lie algebra; here 
the structure is cleaner or the same as in the classical commutative theory. 
The verdict so far on noncommutative Nullstellensatze is mixed. In a free 
algebra it goes through so smoothly that no radical ideal is required. This 



60 J. WILLIAM HELTON AND MIHAI PUTINAR 

leaves us short of the remarkable perfection we see in the Stengle -Tarski - 
Seidenberg commutative landscape. Readers will be overjoyed to hear that 
the proofs needed above are mostly known to them already: just as in earlier 
sections, non- negative functionals on the sums of squares cone in a *-algebra 
can be put in correspondence with tuples of non-commuting operators, and 
this carries most of the day. 

This noncommutative semi-algebraic foundation underlies a rigid struc- 
ture (at least) for free *-algebras which has recently become visible. A 
noncommutative polynomial p has second derivative p" which is again a 
polynomial and if p" is positive, then our forthcoming free *-algebra Posi- 
tivstellensatz tells us that p" is a sum of squares. It is a bizarre twist that 
this and the derivative structure are incompatible, so together imply that a 
"convex polynomial" in a free *- algebra has degree 2 or less; see $3 The 
authors suspect that this is a harbinger of a very rigid structure in a free 
*-algebra for "irreducible varieties" whose curvature is either nearly positive 
or nearly negative; but this is a tale for another (likely distant) day. Some 
of the material in this section on higher derivatives and the next is new. 

A final topic on semi-algebraic geometry in a free *- algebra is appli- 
cations to engineering, §101 Arguably the main practical development in 
systems and control through the 1990's was the reduction of linear systems 
problems to Linear Matrix Inequalities, LMIs. For theory and numerics to 
be highly successful something called "Convex Matrix Inequalities" , hence- 
forth denoted in short CMIs, will do nicely. Most experts would guess that 
the class of problems treatable with CMIs is much broader than with LMIs. 
But no, as we soon see, our draconian free * convexity theorems suggest that 
for systems problems fully characterized by performance criteria based on 
L 2 and signal flow diagrams (as are most textbook classics), convex matrix 
inequalities give no greater generality than LMIs. 

These systems problems have the key feature that their statement does 
not depend on the dimension of the systems involved. Thus we summarize 
our main engineering contention: 

Dimension free convex problems are equivalent to an LMI 
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This and the next sections tells the story we just described but there 
is a lot it does not do. Our focus in this paper has been on inequalities, 
where various noncommutative equalities are of course a special and often 
well developed case. For example, algebraic geometry based on the Weyl 
algebra and corresponding computer algebra implementations, for example, 
Grobner basis generators for the Weyl algebra are in the standard computer 
algebra packages such as Plural/Singular. 

A very different and elegant area is that of rings with a polynomial iden- 
tity, in short PI rings , e.g. N x N matrices for fixed N. While most PI 
research concerns identities, there is one line of work on polynomial inequal- 
ities, indeed sums of squares, by Procesi-Schacher [PS76j . A Nullstellensatz 
for PI rings is discussed in |Amit57j . 

8.1. Sums of squares in a free *-algebra. Let M.(x,x*) denote the poly- 
nomials with real numbers as coefficients in variables x\, ...,x g ,x\, ...,x*. 
These variables do not commute, indeed they are free of constraints other 
than * being an anti-linear involution: 

(/<?)* = Q*r, &)* = 4 

Thus M.{x,x*) is called the real free *— algebra on generators x, x*. 

Folklore has it that analysis in a free *-algebra gives results like ordinary 
commutative analysis in one variable. The SoS phenomenon we describe in 
this section is consistent with this picture, but convexity properties in the 
next section do not. Convexity in a free algebra is much more rigid. 

We invite those who work in a free algebra (or their students) to try 
NCAlgebra, the free free-* algebra computer package [HSM05J. Calculations 
with it had a profound impact on the results in £}8]and[9l it is a very powerful 
tool. 

The cone of sums of squares is the convex hull: 



E 2 = co{/7; feWL(x,x*)}. 
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A linear functional L £ M.(x,x*)' satisfying L| S 2 > produces a positive 
semidefinite bilinear form 

(f,q)=L(q*f) 

onl(i,i*). We use the same construction introduced in section [3l namely, 
mod out the null space of (/, /) and denote the Hilbert space completion by 
H, with T> the dense subspace of H generated by M{x,x*). The separable 
Hilbert space H carries the multiplication operators Mj : T> — ► T>: 

Mjf = Xj f, feV, l<j<n. 

One verifies from the definition that each Mj is well defined and 

(M d f, q) = ( Xj f, q) = (f, x*q), f,q€V. 

Thus M? = M x *. The vector 1 is still *-cyclic, in the sense that the linear 
span V p £B,{x,x*)P(M, M*)l is dense in H. Thus, mutatis mutandis, we have 
obtained the following result. 

Lemma 8.1. There exists a bijective correspondence between positive linear 
Junctionals, namely 

LeR(x,x*)' and L| S 2 > 0, 

and g-tuples of unbounded linear operators T with a star cyclic vector 
established by the formula 

L(f) = {f(T,T*)Z,0, feR(x,x*). 

We stress that the above operators do not commute, and might be un- 
bounded. The calculus f(T, T*) is the non-commutative functional calculus: 
Xj (T)=Tj, x*(T)=T*. 

An important feature of the above correspondence is that it can be re- 
stricted by the degree filtration. Specifically, let M(x,x*)fc = {/; degf < k], 
and similarly, for a quadratic form L as in the lemma, let denote the 
finite dimensional subspace of H generated by the elements of M(x,x*)fc. 
Define also 

T? k = Z 2 nR(x,x*) k . 
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Start with a functional L E M.(x,x*)' 2k satisfying -^Is2 fc > 0. One can 
still construct a finite dimensional Hilbert space H, as the completion of 
M.(x, x*)k with respect to the inner product (/, q) = L(q*f), f,q£ M(x, x*)k- 
The multipliers 

Mj : P fc _x — » ff, Mj/ = Xj f, 

are well defined and can be extended by zero to the whole H. Let 

C2n) k+1 — 1 

N(k) = dimIR<x, x*) k = 1 + (2g) + (2 ff ) 2 + ... + (2g) k = ^ — . 

2g- 1 

In short, we have proved the following specialization of the main Lemma. 

Lemma 8.2. Let L G M(2;, x*)^ satisfy L\-gi > 0. There exists a Hilbert 
space of dimension N{k) and an g-tuple of linear operators M on H , with 
a distinguished vector £ € H, such that 



U) L(p) = {p(M,M*)t,0, 



2fc-2- 



Following the pattern of the preceding section, we will derive now a Nicht- 
negativstellensatz . 

Theorem 8.3 ([H02]). Let p £ ~R{x,x*) ( i be a non-commutative polynomial 
satisfying p(M,AI*) > for all g-tuples of linear operators M acting on a 
Hilbert space of dimension at most N(k), 2k > d + 2. Then p E X 2 . 

Proof. The only necessary technical result we need is the closedness of the 
cone S| in the Euclidean topology of the finite dimensional space M.(x, x*)t- 
This is done as in the commutative case, using Carathedodory's convex hull 
theorem. More exactly, every element of E 2 , is a convex combination of at 
most dimIR(x, x*)^ + 1 elements, and on the other hand there are finitely 
many positive functionals on X 2 , which separate the points of M.(x, x*)k- See 
for details |HMP04a| . 

Assume that p S 2 and let k > (d + 2)/2, so that p E R(x, x*)2k-2- Once 
we know that S|. is a closed cone, we can invoke Minkowski separation 
theorem and find a functional L E M{x, x*)' 2k providing the strict separation: 

L{p) < < L(f), /e4 
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According to Lemma 18.21 there exists a tuple M of operators acting on a 
Hilbert space H of dimension N(k) and a vector £ £ if, such that 

O<(KM,M*)£,0 = L(p)<O, 

a contradiction. ■ 

When compared to the commutative framework, this theorem is stronger 
in the sense that it does not assume a strict positivity of p on a well chosen 
"spectrum". Variants with supports (for instance for spherical tuples M : 
MfMx + ... + M*M g < I) of the above result are discussed in |HMP04a| . 

We state below an illustrative and generic result, from [HM04a], for sums 
of squares decompositions in a free *-algebra. 

Theorem 8.4. Let p E x*) and let q = {q%, qt} C WL(x, x*) be a set 
of polynomials, so that the non- commutative quadratic module 

QM(q) = co{f*q k f; f G R(x,x*}, < i < k}, q = I, 

contains 1 If for all tuples of linear bounded Hilbert 

space operators X = (X±, ...,X g ) subject to the conditions 

qi(X,X*) > 0, 1 < i < k, 

we have 

p(X,X*)>0, 

then p £ QM(q). 

Notice that the above theorem covers relations of the form r(X, X*) = 0, 
the latter being assured by ±r £ QM(q). For instance we can assume that 
we evaluate only on commuting tuples of operators, in which situation all 
commutators included among the (possibly other) generators of 

QM(q). 

Some interpretation is needed in degenerate cases, such as those where no 
bounded operators satisfy the relations qi(X,X*) > 0, for example, if some 
of qi are the defining relations for the Weyl algebra; in this case, we would 
say p(X,X*) > 0, since there are no X. Indeed p G QM(q) as the theorem 
says. 
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Proof Assume that p does not belong to the convex cone QM(q). Since the 
latter is archimedean, by the same Minkovski principle there exists a linear 
functional L G M.(x,x*}', such that 

L(p)<0<L(f), feQM(q). 

Define the Hilbert space H associated to L, and remark that the left mul- 
tipliers M Xi on M(x,x*) give rise to linear bounded operators (denoted by 
the same symbols) on H. Then 

ft(M,M*)>0, l<i<k, 

by construction, and 

(p(M,M*)l,l)=L(p)<0, 

a contradiction. 

The above statement allows a variety of specialization to quotient alge- 
bras. Specifically, if I denotes a bilateral ideal of R(x,x*}, then one can 
replace the quadratic module in the statement with QM(q) + /, and sepa- 
rate the latter convex cone from the potential positive element on the set of 
tuples of matrices X satisfying simultaneously 

qi (x,x*)>o, o<i<k, f(x) = o, fei. 

For instance, the next simple observation can also be deduced from the 
preceding theorem. 

Corollary 8.5. Let J be the bilateral ideal ofR(x,x*} generated by the 
commutator polynomial [x\ + x\,X2 + ^2] — 1- Then J + QM(1 — x\x\ — ... — 
x*x g ) = R(x, x*). 

Proof Assume by contradiction that J+QM(1 X ^ X ^ • • • X g X g ) 7^ x*). 
By our basic separation lemma, there exists a linear functional L G M(x,x*)' 
with the properties: 

Lj+QM(i~x* 1 x 1 -...-x*x g ) > 0, and L(l) > 0. 
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Then the GNS construction will produce a tuple of linear bounded op- 
erators X, acting on the associated non-zero Hilbert space H, satisfying 
X*X 1 + ... + X*X g <Iand 

[Xl+X u X* 2 +X 2 }=I. 

The latter equation is however impossible, because the left hand side is 
anti-symmetric while the right hand side is symmetric and non-zero. 
■ 

Similarly, we can derive following the same scheme the next result. 

Corollary 8.6. Assume, in the condition of the above Theorem, thatp(X,X*) > 
for all commuting tuples X of matrices subject to the positivity con- 
straints qi(X,X*) > 0, < i < k. Then 

p e QM{q) + I, 

where I is the bilateral ideal generated by all commutators [xi,xj], [xi,xj]* , 1 < 
i,j < 9- 

With similar techniques (well chosen, separating, ^representations of the 
free algebra) one can prove a series of Nullstellensatze. We state for infor- 
mation one of them, see for an early version [HMP04bJ. 

Theorem 8.7. Letpi(x), ...,p m (x) £ M(x) be polynomials not depending on 
the Xj variables and let q(x,x*) G K(x,x*). Assume that for every g tuple 
X of linear operators acting on a finite dimensional Hilbert space H , and 
every vector v £ H , we have: 

( Pj (X)v = 0, l<j<m) => (q(X,X*)v = 0). 

Then q belongs to the left ideal M{x,x*)pi + ... + x*)p m . 

Again, this proposition is stronger than its commutative counterpart. For 
instance there is no need of taking higher powers of q, or of adding a sum 
of squares to q. 
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We refer the reader to |HMP06j for the proof of Proposition [821 However, 
we say a few words about the intuition behind it. We are assuming 

Pj (X)v = 0,Vj q(X,X*)v = 0. 

On a very large vector space if X is determined on a small number of vectors, 
then X* is not heavily constrained; it is almost like being able to take X* 
to be a completely independent tuple Y. If it were independent, we would 
have 

Pj (X)v = 0,Vj q(X,Y)v = 0. 

Now, in the free algebra M.{x,y), it is much simpler to prove that this 
implies q G pj, as required. We isolate this fact in a separate 

lemma. 

Lemma 8.8. Fix a finite collection p±, ...,p m of polynomials in non-commuting 
variables {xi, . . . , x g } and let q be a given polynomial in {x\, . . . , x g }. Let d 
denote the maximum of the deg(g) and {deg(pj) : 1 < j < m}. 

There exists a real Hilbert space 7i of dimension X] j=o 9^ > suc ^ that, if 

q(X)v = 

whenever X = (X±, . . . , X g ) is a tuple of operators on TL, v G TL, and 

Pj(X)v = for all j, 
then q is in the left ideal generated by P \, ...,p m . 

Proof (of Lemma). We sketch a proof based on an idea of G. Bergman, see 
[HM04a| . 

Let I be the left ideal generated by pi, ...,p m in F = x g ). Define 

V to be the vector space F/T and denote by [/] the equivalence class of 
/ G F in the quotient F/I. 

Define Xj on the vector space F/Z by Xj[f] = [xjf] for / G F, so that 
Xj i — > Xj implements a quotient of the left regular representation of the free 
algebra F. 
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If V := F/I is finite dimensional, then the linear operators X = (X\ , . . . , X g ) 
acting on it can be viewed as a tuple of matrices and we have, for f £ F, 

f(X)[l] = [/]• 

In particular, = for all j. If we do not worry about the dimension 

counts, by assumption, = g(X)[l], so = [q] and therefore q £ I. Minus 
the precise statement about the dimension of Ti this establishes the result 
when F/I is finite dimensional. 

Now we treat the general case where we do not assume finite dimension- 
ality of the quotient. Let V and W denote the vector spaces 

V :={[/]: f€F, deg(/)<4, 

W :={[/]: f£F, deg(/)<d-l}. 

Note that the dimension of V is at most Yl j=o ^ ■ We define Xj on W to 
be multiplication by Xj. It maps W into V. Any linear extension of Xj to 
the whole V will satisfy: if / has degree at most d, then /(X)[l] = [/]. The 
proof now proceeds just as in the part 1 of the proof above. ■ 

With this observation we can return and finish the proof of Theorem 18.71 
Since X* is dependent on X, an operator extension with properties stated in 
the lemma below gives just enough structure to make the above free algebra 
Nullstellensatz apply; and we prevail. 

Lemma 8.9. Letx = {x%, . . . ,x m }, y = {y\, . . . ,y m } be free, non- commuting 
variables. Let H be a finite dimensional Hilbert space, and let X, Y be two 
m-tuples of linear operators acting on H . Fix a degree d > 1. 

Then there exists a larger Hilbert space K D H , an m-tuple of linear 
transformations X acting on K , such that 

X j \ H = X j , l<j<g, 

and for every polynomial q £ x*) of degree at most d and vector v £ H , 



q(X,X*)v = q(X,Y)v = 0. 
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For the matrical construction in the proof see [HMP06]. 



We end this subsection with an example, see [HM04aJ 



Example 8.10. Let p = (x*x + xx*) 2 and q = x + x* where x is a single 
variable. Then, for every matrix X and vector v (belonging to the space 
where X acts), p(X)v = implies q(X)v = 0; however, there does not exist 
a positive integer m and r, r» G M{x, x*), so that 



$.2 



,2m 



+ r*rj = pr + r*p. 



Moreover, we can modify the example to add the condition p(X) is positive 
semi-definite implies q(X) is positive semi-definite and still not obtain this 
representation. □ 

Proof Since A := XX* + X*X is self-adjoint, A 2 v = if and only if 
Av = 0. It now follows that if p(X)v = 0, then Xv = = X*v and 
therefore q(X)v = 0. 
For A G M, let 

/o A o\ 
X = X(A) =001 
\0 0y 

viewed as an operator on M 3 and let u = ei, where {ei, e2, 63} is the standard 
basis for R 3 . 

We begin by calculating the first component of even powers of the matrix 
q(X). Let Q = q(X) 2 and verify, 

/a 2 a\ 



5.3) 



Q 



1 + A 2 
\A 1J 

For each positive integ 6r TTi tliGrG 6xist & polynomial q m so that 

f\ 2 (l + \q m (\))\ 
(8.4) Q m e 1 = 

\ A(l + \q m {\)) ) 
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which we now establish by an induction argument. In the case m = 1, from 
equation (|8.3p . it is evident that q\ = 0. Now suppose equation (|8.4p holds 
for m. Then, a computation of QQ m e\ shows that equation (|8.4|) holds for 
m + 1 with g m +i = X(q m + A + Xq m ). Thus, for any m, 
1 



1.5) 



lim 



A-To A 2 
Now we look at p and get 



< Q m ei,ei >= lim(l + A 9m (A)) = 1. 

A — >0 



/a 4 







(1 + A 2 ) 2 






V 



Thus 



lim (< r(Xy P (X) ei , ei > + < p(X)r(A>i , e x >) = 0. 

A^O A z 

If the representation of equation (|8,2p holds, then apply < • e\,e\ > to 
both sides and take A to 0. We just saw that the right side is 0, so the left 
side is 0, which because 



<J2r j (X)*r j (X)e 1 ,e 1 >>0 



lim — < Q m e 1 ,e 1 > < 
A^0 A 2 



forces 



a contradiction to equation ( 18.51 ). Hence the representation of equation 
(j8T2j) does not hold. 

The last sentence claimed in the example is true when we use the same 
polynomial p and replace q with q 2 . ■ 

There are more Positivstellensatze in a free *-algebra which fill in more 
of the picture. The techniques proving them are not vastly beyond what 
we illustrated here. For example, Klep-Schweighofer [KS05] do an analog 
of Stengle's Theorem 15.3( a). while Theorem 13.91 is faithfully made free in 
[HM04a]. In spite of the above results we are still far from having a full 
understanding (a la Stengle's Theorem) of the Null- and Positiv-stellensatze 
phenomena in the free algebra. 



POSITIVE POLYNOMIALS AND THE SPECTRAL THEOREM 71 

8.2. The Weyl algebra. Weyl's algebra, that is the enveloping algebra 
of the Heisenberg group is interesting because, by a deep result of Stone- 
von Neumann, it has a single irreducible representation; and that is infinite 
dimensional. Thus, to check on the spectrum the positivity of an element, 
one has to do it at a single point. The details were revealed by Schmiidgen 
in a very recent article [S05| . We reproduce from his work the main result. 

Fix a positive integer g and consider the unital *-algebra W(g) generated 
by 2g self-adjoint elements pi, ...,p g , qi, q g , subject to the commutation 
relations: 

\Pk,Qj] = -$kj(i- 1)) \pk,Pj] = [lj,<lk] = 0, l<j,k<g. 

The unique irreducible representation of this algebra is given by the partial 
differential operators 

Hpk)f = -»/, = x k f, 

dx k 

acting on Schwartz space Via this representation, the elements of 

W(g) are identified with linear partial differential operators with polynomial 
coefficients (in g variables). These operators can be regarded as densely 
defined, closed graph operators from to L 2 (M. 9 ). The set 

W(g) + ={fe W{g); <$(/)£, > 0, £ e «S(M»)} 

consists of all symmetric, non-negative elements, with respect to the repre- 
sentation 
Define 

Qk + iPk Qk ~ ipk 

so that = a_&. Fix a positive number a which is not an integer, and let 

N = a*ai + ... + a*a g ; 

denote by N the set of all finite products of elements N + (a + n)l, with 
n G Z. 

The algebra W(g) carries a natural degree, defined on generators as 

deg(a fc ) = deg(a_ fe ) = 1. 
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Every element / G W(g) can be decomposed into homogeneous parts f s of 
degree s: 

f = fm + fm-l + ••• + /o- 

We can regard fk as a homogeneous polynomial of degree k, in the variables 

a±i, ...,a± g . The principal symbol of / is the polynomial 

f m (zi, z g ,zi, ...,z g ), where chj was substituted by Zk and a_fc by zj^. 

Theorem 8.11. [S05] Let f E W(g) be a self-adjoint element of even degree 
2m, and let P(z,z) be its principal symbol. If 

a) . There exists e > such that f — e • 1 € W(g)+, 

b) . P(z,z) > for z / 0, 

then, if m is even there exists b G N such that bfb G X 2 iy(g); i/m is odd, 
there exists b £ J\f such that Ylj=i bajfa^jb £ T, 2 W(g). 

For examples and details see [S05]. 

Already mentioned and annotated was our serious omission of any de- 
scription of the Nullstellensatz in a Weyl Algebra. 

8.3. Sums of squares modulo cyclic equivalence. A still open, impor- 
tant conjecture in the classification theory of von Neumann algebras was 
recently reduced by F. Radulescu to an asymptotic Positivstellensatz in the 
free algebra. We reproduce from his preprint |Radul04] the main result. 
We do not explain below the standard terminology related to von Neumann 
algebras, see for instance |Tak02] . 

The following conjecture was proposed thirty years ago in |Connes76] : 

Every type ll\ factor can be embedded into an ultraproduct of the hyper- 
finite factor. 

There are presently quite a few reformulations or reductions of this con- 
jecture. The one of interest for this survey can be formulated as follows. 
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Let F = C(x\, ■■■■ > Xg) be the free algebra with anti-linear involution x* = 
Xj, 1 < j < g. We complete F to the algebra of convergent series 

F = a w w; KI?* H < oo, Vr > 0}, 

w w 

where w runs over all words in F and o m 6 C. The resulting Frechet space 
F carries a natural weak topology denoted cr(F, F*). 

A trace r in a von-Neumann algebra M is a linear functional which has 
by definition the cyclic invariant property T(a\...a n ) = t ((12(13... a n a\). Two 
series /i,/2 S F are called cyclically equivalent if f\ — fi is the weak limit 
of a linear combination of elements w — w', where w £ F is a word and u/ 
is a cyclic permutation of it. 

The following asymptotic Positivstellensatz holds. 

Theorem 8.12. [Radul04] Let f G F be a symmetric series with the prop- 
erty that for every separable, type II\ von Neumann algebra (M, r) and 
every g-tuple of self- adjoint elements X of M we have r(f(X)) > 0. Then 
f is cyclically equivalent to a weak limit of sums of squares s n , s n S T, 2 F. 

It is not known whether one can replace the test 1I\ algebras by finite 
dimensional algebras, but an answer to this querry would solve Connes con- 
jecture. 

Corollary 8.13. Connes embedding conjecture holds if and only if for every 
symmetric element f € F the following assertion holds: 

f is cyclically equivalent to a weak limit of sums of squares s n , s n 
if and only if for any positive integer d and g-tuple of self-adjoint d x d 
matrices X one has trace/ (X) > 0. 

The proofs of Radulescu's theorem and the corollary follow the same 
pattern we are by now familiar with: a convex separation argument followed 
by a GNS construction. See for details [Radul04] . and for a last minute 
refinement [KS06]. 
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9. Convexity in a free algebra 

Convexity of functions, domains and their close relative, positive curva- 
ture of varieties, are very natural notions in a *-free algebra. A shocking 
thing happens: these convex functions are so rare as to be almost trivial. 
This section illustrates a simple case, that of convex polynomials, and we 
see how in a free algebra the Nichtnegativtellensatze have extremely strong 
consequences for inequalities on derivatives. The phenomenon has direct 
qualitative consequences for systems engineering as we see in §101 The re- 
sults of this section can be read independently of all but a few definitions in 
and the proofs require only a light reading of it. 

This time M.{x) denotes the free *-algebra in indeterminates x = (x%, x g ), 
over the real field. There is an involution x*j = Xj which reverses the order of 
multiplication (fp)* = p* f* ■ In this exposition we take symmetric variables 
Xj = x*j, but in the literature we are summarizing typically Xj can be taken 
either free or symmetric with no change in the conclusion, for example, the 
results also hold for symmetric polynomials in M(x,x*). 

A symmetric polynomial p,p* = p, is matrix convex if for each positive 
integer n, each pair of tuples X = (X\, . . . , X g ) and Y = (Yi, . . . , Y g ) of 
symmetric n x n matrices, and each < t < 1, 

(9.1) p(tX + (1 - t)Y) < tp(X) + (1 - t)p{Y). 

Even in one-variable, convexity in the noncommutative setting differs from 
convexity in the commuting case because here Y need not commute with X. 
For example, to see that the polynomial p = x 4 is not matrix convex, let 

/42\ /2 0\ 

X = and Y = 

V 2 2 ; V o o ; 

and compute 

2 2 4 2 ' ^ 120 84 J 
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which is not positive semi-definite. On the other hand, to verify that x 2 is 
a matrix convex polynomial, observe that 

tX 2 + (1 - t)Y 2 - (tX + (1 - t)Y) 2 

= t(l - t){X 2 -XY -YX + Y 2 ) = t(l - t)(X - Y) 2 > 0. 

Theorem 9.1. [HM04b] Every convex symmetric polynomial in the free 
algebra R(x) or M.(x,x*) has degree two or less. 

As we shall see convexity of p is equivalent to its "second directional de- 
rivative" being a positive polynomial. As a matter of fact, the phenomenon 
has nothing to do with order two derivatives and the extension of this to 
polynomials with k th derivative nonnegative is given later in Theorem 19.41 

Yet stronger about convexity is the next local implies global theorem. 

Let V denote a collection of symmetric polynomials in non-commutative 
variables x = {x\,--- ,x g }. Define the matrix nonnegativity domain T>-p 
associated to V to be the set of tuples X = {X\, ■ ■ ■ ,X g ) of finite dimen- 
sional real matrices of all sizes, except dimensions, making p(X\, • • • , X g ) 
a positive semi-definite matrix. 

Theorem 9.2. [HM04bJ Suppose th ere is a set V of symmetric polyno- 
mials, whose matrix nonnegativity domain Dp contains open sets in all 
large enough dimensions. Then every symmetric polynomial p in M.{x) or in 
WL(x, x*) which is matrix convex on D-p has degree two or less. 

The first convexity theorem follows from Theorem 18.31 an d we outline 
below the main ideas in its proof. The proof of the more general, order k 
derivative, is similar and we will return to it later in this section. The proof 
of Theorem 19.21 requires different machinery (like that behind representation 
(jl0.5l )) and is not presented here. 

At this point we describe a bit of history. In the beginning was Karl 
Lowner who studied a class of real analytic functions in one real variable 
called matrix monotone, which we shall not define here. Lowner gave in- 
tegral representations and these have developed beautifully over the years. 
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The impact on our story comes a few years later when Lowner's student 
Klaus |K36| introduced matrix convex functions / in one variable. Such a 
function / on [0, oo] C K can be represented as f(t) = tg(t) with g ma- 
trix monotone, so the representations for g produce representations for /. 
Modern references are IQSTprept] , [U02] . Frank Hansen has extensive deep 
work on matrix convex an monotone functions whose definition in several 
variables is different than the one we use here, see |HanT06j ; for a recent 
reference see |Han97| . 

For a polynomial p G R(x) define the directional derivative: 

p'{x)[h] = j t p(x + th) ]t ^. 

It is a linear form in h. Similarly, the k th derivative 

d^ 

P {k \x)[h] = -^p(x + th)\ t=0 

is homogeneous of degree k in h. 

More formally, we regard the directional derivative p'(x)[h] £ M.{x,h) 
as a polynomial in 2g free symmetric (i.e. invariant under *) variables 
(xx, . . . , x g , hi, . . . , h g ); In the case of a word w = Xj 1 Xj 2 ---Xj n the de- 
rivative is: 

w [h] = hj 1 Xj 2 ■ ■ ■ Xj n + Xj 1 hj 2 xj 3 ■ ■ ■ xj n + . . . + Xj x • • • xj n _ 1 hj n 

and for a polynomial p = p'(x)[h] = ^2p w w the derivative is 

P'( x )[h] = ^2p w w'[h}. 

If p is symmetric, then so is p'. 

For g-tuples of symmetric matrices of a fixed size X, H, observe that the 
evaluation formula 

holds. Alternately, with q(t) = p{X + tH), we find. 

p'(X)[H]=q'(0), 

Likewise for a polynomial p G M{x), the Hessian p"(x)[h] of p(x) can be 
thought of as the formal second directional derivative of p in the "direction" 
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h. Equivalently, the Hessian of p{x) can also be denned as the part of the 
polynomial 

r(x)[/i] := p(x + h) — p(x) 

in the free algebra in the symmetric variables that is homogeneous of degree 
two in h. 

If p" 7^ 0, that is, if degree p > 2, then the degree of p"(x)[h] as a polyno- 
mial in the 2g variables x\, . . . , x g , h± . . . , h g is equal to the degree of p(x) 
as a polynomial in x±, . . . , x g . 

Likewise for k th derivatives. 

Example 9.3. 1. p(x) = X2X1X2 

p'(x)[h] = "^[(^2 +th2)(xi +thl)(x2 +^2)]| t=0 = h 2 X\X2 + X 2 hiX2 + X2Xih 2 . 

2. One variable p(x) = x 4 . Then 

p'(x)[h] = hxxx + xhxx + xxhx + xxxh 

Note each term is linear in h and h replaces each occurrence of x once and 
only once: 

p"(x)[h] = 
hhxx + hhxx + hxhx + hxxh+ 
hxhx + xhhx + xhhx + xhxh+ 
hxxh + xhxh + xxhh + xxhh, 

which yields 

p"(x)[h] = 2hhxx + 2hxhx + 2hxxh + 2xhhx + 2xhxh + 2xxhh. 

Note each term is degree two in h and h replaces each pair of x's exactly 
once. Likewise 

p^(x)[h} = 6(hhhx + hhxh + hxhh + xhhh) 
and p^{x)[h] = 24hhhh and p^(x)[h] = 0. 

3. p = x\x 2 

p"(x)[h] = h\x 2 + h\x\h 2 + x\h\h 2 . 
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□ 

The definition of a convex polynomial can be easily adapted to domains. 
Then one remarks without difficulty that, in exact analogy with the commu- 
tative polynomial p is convex (in a domain) if and only if the Hessian 
evaluated at the respective points is non-negative definite. Because of this 
Theorem 19.11 is an immediate consequence of the next theorem restricted to 
k = 2. 

Theorem 9.4. Every symmetric polynomial p in the free algebra or 
M.(x, x*) whose k th derivative is a matrix positive polynomial has degree k 
or less. 

Proof (when the variables symmetric) . 

Assume p( k \x)[h] is a matrix positive polynomial, so that, in view of 
Theorem 18.31 we can write it as a sum of squares: 

p( k \x)[h] = j2fifn 

here each fj(x, h) is a polynomial in the free algebra M(x, h). 

If (x) [h] is identically equal to zero, then the statement follows. As- 
sume the contrary, so that p( k \x)[h] is homogeneous of degree k in h, and 
there are tuples of matrices X, H and a vector £ in the underlying finite 
dimensional Hilbert space, so that 

<p( fc )(A)[tf]£,£) >0. 

By multiplying H by a real scalar t we find 

t k {p^{X)[H%i) = (p( k \X)[tH]Z,0 > 0, 

whence k = 2// is an even integer. 

Since in a sum of squares the highest degree terms cannot cancel, the 
degree of each fj is at most v in x and fx in h, where 2v is the degree of p^ 
in x. 

Since p^ is a directional derivative, it must have a highest degree term 
of the form ■ ■ ■ hi k m(x) where the monomial m(x) has degree equal to 
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degree p^ — k; also is allowed to equal hi t . Thus some product, denote 
it fjfj, must contain such a term. (Note the the order of the h's vs. the 
x's matters.) This forces fj to have the form 

fj = ci(hi lt+1 ---h ik )m(x) + c 2 (hi 1 ■ ■ -h^) + 

the Cj being scalars. 

To finish the proof use that fjfj contains 

c 2 m(x)*(h ilt +1 • • • h ik )*(hi^ +1 • • • h ik )m{x) 

and this can not be cancelled out, so 

degp (fc) = k + 2(degp (fc) - k) = 2degp (fc) - k. 

Solve this to find degp^ = k. Thus p has degree k. ■ 

We use a previous example in order to illustrate this proof when k = 2. 

Example 9.5. Example p = x 4 is not matrix convex; here x = x* . 
Calculate that 

p [x)[h] = 2hhxx + 2hxhx + 2hxxh + 2xhhx + 2xhxh + 2xxhh. 

Up to positive constants some polynomial fjfj contains a term hhxx, so 
fj = hxx + h + . ... 

So fjfj contains xxhhxx. This is a highest order perfect square so can 
be cancelled out. Thus is appears in p", which as a consequence has degree 
6. This a contradiction. □ 

We call the readers attention to work which goes beyond what we have 
done in several directions. One |HMVprept| concerns a noncommutative 
rational function r and characterizes those which are convex near 0. It is 
an extremely small and rigidly behaved class, for example, r is convex on 
the entire component of the "domain of r" which contains 0. This rigidity 
is in analogy to convex polynomials on some "open set" having degree 2 
or less and this implying they are convex everywhere. Another direction is 
the classification of noncommutative polynomials whose Hessian p"(x)[h] at 
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most k "negative noncommutative eigenvalues" In |DHMprept| it is shown 
that this implies 

degp <2k + 2. 

Of course the special case we studied in this section is exactly that of poly- 
nomials with k = 0. 

10. Dimension free engineering: LMIs vs. CMIs 

This section demonstrates the need for real algebraic geometry (in the 
broad sense) aimed at convexity over a free or nearly free *- algebra. From 
this viewpoint the theory in this survey goes in an essential direction but 
much more is called for in order to do general engineering problems. Hope- 
fully the brief description in this section will give a little feel for where we 
now stand. We are aiming at one of the major issues in linear systems theory: 

Which problems convert to a convex matrix inequality, CMI? How does 
one do the conversion? 



To be in line with the engineering literature, we use below a slightly dif- 
ferent notation than the rest of the article. For instance A T denotes the 
transpose of a (real entries) matrix, and A T replaces in this case the invo- 
lution A* we have encountered in the previous sections. The inner product 
of vectors in a real Hilbert space will be denoted u ■ v. 

10.1. Linear systems. A linear system 5 is given by the linear differential 
equations 

dx . _ 
— = Ax + Bu 
dt 

y = Cx 

with the vector 

• x(t) at each time t being in the vector space X called the state space, 



u(t) at each time t being in the vector space U called the input space, 
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• y(t) at each time t being in the vector space y called the output 
space, 

and A, B, C being linear maps on the corresponding vector spaces. 

10.2. Connecting linear systems. Systems can be connected in incred- 
ibly complicated configurations. We describe a simple connection and this 
goes along way toward illustrating the general idea. Given two linear sys- 
tems ©, we describe the formulas for connecting them as follows. 



u 



+ 







y 


J- > 






t 















y 





Systems J and © are respectively given by the linear differential equations 



— = Ax + Be, 
dt 



t J-h 



y = Cx, v = c£. 

The connection diagram is equivalent to the algebraic statements 

w = y and e = u — v. 

The closed loop system is a new system whose differential equations are 

dx . „ . „ 
— = Ax- Bcf + Bu, 
dt 



dt 



In matrix form this is 



a£ + by = a£ + bCx, 
y = Cx. 



bC a U V J 



y = (CQ) 



x 
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where the state space of the closed loop systems is the direct sum l X © y 
of the state spaces X of # and y of &. The moral of the story is: 

System connections produce a new system whose coefficients are matrices 
with entries which are polynomials in the coefficients of the component sys- 
tems. 

Complicated signal flow diagrams give complicated matrices of polyno- 
mials. Note in what was said the dimensions of vector spaces and matrices 
never entered explicitly; the algebraic form of (jlO.ll ) is completely deter- 
mined by the flow diagram. We have coined the term dimension free for 
such problems. 

10.3. Energy dissipation. We have a system J and want a condition which 
checks whether 



holds for all input functions u, where $u = y in the above notation . If this 
holds 5 is called a dissipative system 



This is analysis but it converts to algebra because of the following con- 
struction. Hope there is a "potential energy" like function V > 0, V(0) = 0, 
on the state space; it should satisfy: 

potential energy now + energy in > potential energy then + energy out. 

In mathematical notation this is 
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and a V > 0, V(0) = 0, which satisfies this for all input functions u and 
initial states x(t\) is called a storage function. We can manipulate this inte- 
gral condition to obtain first a differential inequality and then an algebraic 
inequality, as follows: 

o> «^M + * /^OP-HOP, 

12 —II 12 — *1 Jti 

o > vy(xfe)) • J(t a ) + |^i)l 2 -Kii)l 2 - 

Use ^(ti) = Ac(ti) + B«(ti) to get 

> VF(x(t!))-(i4a:(ti)+flu(ti)) + |Ca;((ii))| 2 - K(*i)| 2 . 

The system is dissipative if and only if this holds for all u(t\), x(t\) which 
can occur when it runs (starting at x(0) = 0). All vectors u{t\) in IA can 
certainly occur as an input and if all x(t\) can occur we call the system 
reachable . Denote x(t\) by x and u(t±) by u 

(10.2) 0> VV(x) ■ (Ax + Bu) + \Cx\ 2 -\u\ 2 , 
and conclude: 

Theorem 10.1. The system A, B,C is dissipative if inequality \10.2\i holds 
for all u €U,x £ X. Conversely, if A,B,C is reachable, then dissipativity 
implies inequality 1110.2]) holds for all u GU,x S X . 

For a linear system we try V which is quadratic, so V(x) = Px ■ x with 
P > and VV(x) = 2Px. At this point there are two commonly pursued 
paths which constitute the next two subsections. 

10.3.1. Riccati inequalities. Use that W(x) = 2Wx in (|10.2p to get 

> 2Wx ■ {Ax + Bu) + \Cx\ 2 - \u\ 2 , for all u, x, 

so 

(10.3) > max([WA + A T W + C T C]x ■ x + 2B T Wx ■ u - \u\ 2 ). 
The maximizer in u is u = B T Wx, hence 

> 2Wx • Ax + 2\B T Wx\ 2 + \Cx\ 2 - \B T Wx\ 2 . 



8-1 
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Which in turn is 

> [WA + A T W + WBB T W + C T C]x ■ x. 

This is the classical Riccati matrix inequality 

0> WA + A T W + WBB T W + C T C 

which together with W > insures dissipativity and is also necessary for it 
when the system is reachable. 

10.3.2. Linear Matrix Inequalities (LMI). Alternatively we do not need to 
compute max u but can express (110.31) as the inequality: 



x \ x 



, WA + A T W + C T C WB \ . .. . . , ..- . 



B T W -I 
for all u G U, x S X. That is the following matrix inequality holds: 



WA + A T W + C T C WB 
B T W -I 



< 0. 



Here A, B, C describe the system and W is an unknown matrix. If the 
system is reachable, then A, B, C is dissipative if and only if L{W) < and 
W > 0. 

Recall that the Schur complement of a matrix is defined by 

(a P \ _i r 

:= a — /?7 B . 
f 7 J 

Suppose 7 is invertible. The matrix is positive semi-definite if and only if 
7 > and its Schur complement is positive semi-definite. Note that 

SchurComp L{W) = WA + A T W + WBB T W + C T C 

featuring the Riccati inequality we saw before. Indeed, L(W) < if and only 
if WA+A T W+WBB T W+C T C < 0, since this implies WA+A T W+C T C < 
0. Thus the Riccati approach and the LMI approach give equivalent answers. 
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10.4. Example: An H°° control problem. Here is a basic engineering 
problem: 

Make a given system dissipative by designing a feedback law. 
To be more specific, we are given a signal flow diagram: 



w 


Given 

A B\, £>2, Ci, C 2 
D 12 , D 21 


out 


u 


y 










Find 

a, b, c 











where the given system is 

ds „ „ 

— = As + Biw + B 2 u, 
dt 

out = C\s + D± 2 u + Duw, 

y = C 2 s + D 2 iw, 



D 21 = I, D 12 D' l2 = I, D' l2 D 12 =I, £>n = 0. 
We want to find an unknown system 

— = a£ + b, u = c£, 

called the controller, which makes the system dissipative over every finite 
horizon. Namely: 

T 

\w(t)\ 2 dt> J \out(t)\ 2 dt, s(0) = 0. 
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So a, b, c are the critical unknowns. 

10.4.1. Conversion to algebra. The dynamics of the "closed loop" system 
has the form 



dt 



";m;) +i * 

out = C J + Vw 

where A, B,C,T> are "2x2 block matrices" whose entries are polynomials 
in the A's, B's, ■ ■ ■ ,a,b,c etc. The storage function inequality which corre- 
sponds to energy dissipation has the form 

(10.4) H := A T E + EA + EBB 1 'E + C t C<0 

where E has the form 

(E\i E\2 \ y 
, E\2 = E 2l . 

E21 E22 J 

The algebra problem above in more detail is to solve inequality (|10.4p 

( H ss H sy \ 
H = I J < 0, H sy — H ys , 

V Hys E yy ) 
where the entries of H are the polynomials: 



H„ = En A + A 1 E u + C? d + E{ 2 b C 2 + C 2 J b 1 E( 2 + E n B 1 b 1 E( 2 + 
En Bi Bi En + E12 b b T E12 + E12 b B x En , 
H sz = E 2 iA+ ^ ( ^ 21 2 + E ^ +c t Ci+ E22 bC 2 + c T Bl E\\ + 
E21 Bi b T (E 2 i + Ef 2 ) T rp E 22 bb T (E 2 i + Ef 2 ) j> j, 

h-C/21 Bi B 1 h, xl -\ \-h122OD1 

H zs = A T Eti+C? c+ {El2+ 2 E ^ a +En B 2 c+C? b T E^+En B 1 b T E? 2 + 

rp r> nT rpT , Cglg + gfl ) b b T E 22 , ( E 12 + E 2l ) b B l E 21 

En Bi B x B 21 H - 1 , 

H zz = E 22 a + a T E% 2 + c T c + E 2X B 2 c + c T Bj E\ x + E 2X B x b T E% 2 + 
E 2 i Bi B\ E 2 i + E 22 b b T E 22 + £"22 b B x E21 ■ 



T iT rpT 



uT rpT 
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Here A, B\, B2, C±, C2 are known and the unknowns are a, b, c and for 
E\2, E21 and i?22- 

We very much wish that these inequalities are convex in the unknowns 
(so that numerical solutions will be reliable). But our key inequality above 
is not convex in the unknowns. 

10.4.2. The key question. Is there is a set of noncommutative convex in- 
equalities with an equivalent set of solutions? 

This is a question in algebra not in numerics and the answer after a lot 
of work is yes. The path to success is: 

(1) Firstly, one must eliminate unknowns, change variables and get a 
new set of inequalities K.. 

(2) Secondly, one must check that 1C is "convex" in the unknowns. 

This outline transcends our example and applies to very many situations. 
The second issue of this is becoming reasonably understood, for as we saw 
earlier, a convex polynomial with real coefficients has degree two or less, so 
these are trivial to identify. While the level of generality of the theory we 
have presented in this paper is less than we now require, to wit, polynomials 
with indeterminates as coefficients and matrices with polynomial entries; 
this does not add radically different structure, see discussion in ^1U. 51 The 
first issue, changing variables, is still a collection of isolated tricks, with 
which mathematical theory has not caught up. For the particular problem 
in our example we shall not derive the solution since it is long. However, we 
do state the classical answer in the next subsection. 

10.4.3. Solution to the Problem. The textbook solution is as follows, due to 
Doyle-Glover- Kargonekar- Francis. It appeared in [DGKF89] which won the 
1991 annual prize for the best paper to appear in an IEEE journal. Roughly 
speaking it was deemed the best paper in electrical engineering in that year. 
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We denote 

DGKFx := (A - B 2 C 1 )'X + X(A - B 2 Ci) + X(-f- 2 B 1 B[ - B 2 l B' 2 )X 

DGKF Y := A X Y + YA X> + Y^C^Cx - C' 2 C 2 )Y, 
where A x := A - B X C 2 . 

Theorem 10.2. [DGKF89] There is a system solving the control problem 
if there exist solutions 

X > and Y > 

to inequalities the 

DGKF Y < and DGKFx < 
which satisfy the coupling condition 

X-Y- 1 < 0. 

This is if and only if provided Y > is replaced by Y > and Y^ 1 is 
interpreted correctly. 

This set of inequalities while not usually convex in X, Y are convex in the 
new variables W = X^ 1 and Z = Y^ 1 , since DGKFx and DGKFy are lin- 
ear in them and X— Y~ l = W^ 1 — Z has second derivative 2W~ X HW' 1 HW^ 1 
which is non negative in H for each W~ l = X > 0. These inequalities are 
also equivalent to LMIs which we do not write down. 

10.4.4. Numerics and symbolics. A single Riccati inequality is much more 
special than an LMI and numerical solvers for Riccatis are faster and handle 
bigger matrices. This survey obviously has not aimed at numerics, but at 
algebraic precursors to using numerics. 

The mathematics here aims toward helping an engineer who writes a 
toolbox which other engineers will use for designing systems, like control 
systems. What goes in such toolboxes is algebraic formulas like the DGKF 
inequalities above with matrices A,B,C unspecified and reliable numerics 
for solving them when a user does specify A, B, C as matrices. A user who 
designs a controller for a helicopter puts in the mathematical systems model 
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for his helicopter and puts in matrices, for example, A is a particular i? 8x8 
matrix etc. Another user who designs a satellite controller might have a 
50 dimensional state space and of course would pick completely different 
A, B, C. Essentially any matrices of any compatible dimensions can occur 
and our claim that our algebraic formulas are convex in the ranges we specify 
must be true. 

The toolbox designer faces two completely different tasks. One is manip- 
ulation of algebraic inequalities; the other is numerical solutions. Often the 
first is far more daunting since the numerics is handled by some standard 
package. Thus there is a great need for algebraic theory. 

10.5. Engineers need generality. To make exposition palatable in this 
paper we have refrained from generality which does not have much effect on 
mathematical structure. However, to embrace linear systems problems we 
need more general theorems. A level of generality which most linear systems 
problems require is to work with polynomials p in two classes of variables 
p(a, x) where we shall be interested in matrix convexity in x over ranges of 
the variable a. Describing this setup fully takes a while, as one can see in 
[CHSY03] where it is worked out. An engineer might look at CHSprept|, 
especially the first part which describe a computational noncommutative 
algebra attack on convexity, it seems to be the most intuitive read on the 
subject at hand. Here we try to indicate the idea. In private notes of Helton 
and Adrian Lim one shows that second derivatives of p(a, x) in x determine 
convexity in x and that convexity in the x variable on some "open set" of 
a, x implies that p has degree 2 or less in x. From this we get 

If P(a,x) is a symmetric d x d matrix with polynomial entries pij(a,x), 
then convexity in x for all X and all A satisfying some strict algebraic in- 
equality of the form g(A) > 0, implies each pij has degree 2 or less. 

We obtain this from the following argument. We shall test P(a,x) by 
plugging in tuples A and X of n x n matrices for a and x. First note 
that matrix convexity of P in X through a range of A, X implies that the 
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diagonals pa must have this property. Thus they have degree 2 or less in 
x. Consider how the Hessian F(t) := P xx (A,tX)[H] in x scales with a 
scalar parameter t. The matrix function being convex implies its diagonals 
are convex. Thus as we saw above, F^it) is independent of t for all k. 
Apply F(t) to the vectors v = column (=ti>i, • • • , ifd) in R nxn and use that 
v T F(t)v > for all t, to get that for each i,j the entries F^ satisfy 

vjFii^Vi + vjFjj^Vj > ±{vjF ij (t)v j f. 

This implies by letting t — > oo that the degree of vfFij(t)vj in t is 0, which 
implies the same for Fij(t). To this point we have that all polynomials in 
P xx (a,x)[h] are independent of X whenever matrix tuples A from an open 
set {A : g{A) > 0} are plugged in. This is independent of the size n of the 
matrices we plug in, so all polynomials in P xx (a,x)[h] are 0, algebraically 
speaking. Thus all polynomials in P(a, x)[h] have degree 2 in x or less. The 
engineering conclusion from all of this is formulated below. 



10.6. Conclusion. 

(1) Many linear systems problems which are "dimension free" readily 
reduce to noncommuting inequalities on d x d matrices of polynomials 
of the form P(a, x) < 0. These do so as in the £ jl0.4l example, or 
even after simplifying solving and substituting they yield a matrix 
of polynomials. 

(2) If such P(A, X) is X-convex on the set of n x n matrix tuples A 
satisfying a strict polynomial inequality g{A) > and on all X 
(regardless of dimension n), then P(a,x) has degree 2 in x, as we 
saw in §10.51 Alas, P is surprisingly simple. 

(3) Assume a d x d matrix of polynomials P(a, x) has degree 2 in x. 
There are tests (not perfect) to see where in the a variable P(X, A) 
is negative semi-definite for all X. Equivalently, to see where P is 
convex in x. 
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(4) Convexity and the degree 2 property imply P(a, x) < can be ex- 
pressed as an LMI. Often the LMI can be constructed with coeffi- 
cients which are noncommutative polynomials (dimension free for- 
mula). See proof below. 

This very strong conclusion is bad news for engineers and we empha- 
size that it does not preclude transformation to convexity or convexity for 
dimension dependent problems. 

10.6.1. Tests for convexity and the making of an LMI. Here we shall sketch 
of the proof of Conclusions (|3|) and (j3|). We use methods not described 
earlier in this paper, but despite that restrict our presentation to be only a 
brief outline. For proofs in detail see [CHSY03] or more generally IHMPpos] . 

Suppose q(a)[h] is a symmetric polynomial in a, h which is homogeneous 
of degree 2 in h, then q being quadratic in h, can be represented as 

(10.5) q{a)[h] = V{a)[h] T M{a)V{a)[h] 

where M is matrix of noncommutative polynomials in a, and V is a vector 
each entry of which is a monomial of the form hjm(a) where m{a) is a 
monomial in a. We can choose the representation so that no monomial 
repeats. A key is Theorem 10.10 and Lemma 9.4 in [CHSY03] which imply 

Lemma 10.3. Let q(a,h),g(a) be polynomials in the free algebra with q 
purely quadratic inh. Thenq(A)[H] > for g(A) > and all H is equivalent 
to M(A) > for g{A) > 0. 

We shall apply this by representing P xx (a,x)[h] = V (a)[h] T M(a)V (a)[h}. 
The x- Hessian being quadratic in x satisfies P xx (a,x)[h] is independent of 
x. From the lemma we have P XX (A,X)[H] > for <?(^4) > and all X is 
equivalent to M(A) > for g{A) > 0. 

Two tests for positivity as mentioned in Conclusion (|3|) follow. 

(1) The test in [CHSY03] is: take the symbolic noncommutative M{a) = 
L(a) T D(a)L(a) decomposition of M(a). This gives D(a) a matrix 
with diagonal or 2 x 2 block diagonal entries which are nc rational 
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functions in a. M{A) > if and only if D{A) > 0, so checking posi- 
tivity of the functions on diagonal D{a) is a test for where p(a, x) is 
convex. 

(2) Here is another test. If the Positivstellensatz holds (despite a failure 
of the strict positivity hypothesis), then 

(10.6) M(a, i)GS 2 + ^ rjgrj + ^ tfj(C 2 - xjx^Uj. 

Computing the terms Tj,tij and the sums of squares component gives 
an algebraic test. 

Conversion to LMIs, namely, Conclusion Denote by P^(a,rc) the 

terms in P(a,x) with x degree exactly one, respectively P (a, x)with x 
degree exactly two. 

(1) We now give quite a practical numerical algorithm for producing an 
LMI, under no offensive assumptions. However, we do not get for- 
mulas which are polynomials in the symbol a. Once n x n matrices A 
are given with M(A) positive semi-definite we can compute numeri- 
cally its Cholesky decomposition M(A) = L(A) T D(A)^ D(A)% L(A) 
(actually any square root of M(A) will do). Then we have 

P n (A,X) = V{A)[X) T L{A) T D{A)^ D(A)?L(A)V(A)[X] 

and taking 



C(A)[X] :- 



p°(A,X) +p I (A,X) D(A)\L(A)V(A)[X] 
V{A)[X] T L{A) T D{A)^ -I 



which produces £(^4)[X] whose Schur complement equals P(A,X) 
and which produces a matrix inequality 

(10.7) {X : P(A,X) < 0} = {X : C(A)[X] < 0}. 

The entries of £(j4)[X] are linear in scalar unknowns Xi m and have 
nxn matrix coefficients. This is standard input to the LMI numerical 
solvers prevalent today. 
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(2) Another recipe which produces algebraic formulas for solution the 
following. Continue with item (2) above. The terms P n (a,x) in 
P(a, x) with x degree exactly two can be represented by M(a) as in 
(|10.5p . From the Positivstellensatz (|10.6p for M(a) and the fact that 
linear terms are trivial to handle, we can easily build an algebraic 
expression for a matrix £(a)[x] with polynomial entries which are 
linear in x whose Schur complement equals P(a,x). Moreover, for 
any fixed A satisfying g(A) > 0, the solution sets to our favorite 
matrix inequality P > and the LMI based on C are the same, as in 
(|10.7p . This completes the proof that, if the Positivstellensatz (|10.6p 
for P n (a,x) exists, then a LMI which is polynomial in a exists. 

As a side remark, for the degree 2 and other properties of matrix valued 
polynomials we could use weaker hypotheses allowing coupling of a and x 
(as in done in private Helton- Lim notes for polynomials), these probably 
work by the same argument, basically the argument in [HM04b]). 

10.7. Keep going. This subject of noncommutative real algebraic geome- 
try and its geometric offshoots on convexity is a child of the 21 s * century. 
Understanding the relationship between Convex Mis and LMIs was a core 
motivation for its developments. When we look at the two basic techniques 
in § (| 1 . 4 . 2 [) what we have done in this paper bears successfully on issue [2j 
But nothing has been said about issue [TJ Nick Slinglend's UCSD thesis in 
progress makes a start in that direction. 

This physical section has focused on "dimension free" problems. What 
about dimension dependent ones? In these problems the variables commute. 
There the behavior is quite different; as we saw in £J7] there is an extra 
constraint beyond convexity to have equivalence to an LMI. 

11. A GUIDE TO LITERATURE 

While classical semi-algebraic geometry has developed over the last cen- 
tury through an outpouring of seemingly countless papers, the thrust toward 
a noncommutative semi-algebraic geometry is sufficiently new that we have 
attempted to reference the majority of papers directly on the subject here 
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in this survey. This non-discriminating approach is not entirely good news 
for the student, so in this section we provide some guidance to the more 
readable references. 

The Functional Analysis book by Riesz and Nagy [RN90] is a class in 
itself. For a historical perspective on the evolution of the spectral theorem 
the reader can go directly to Hilbert's book |Hiibl953] or the German En- 
cyclopedia article by Hellinger and Toeplitz [HT53] . Reading von Neumann 
in original [vNl] is still very rewarding. 

The many facets of matrix positivity, as applied to function theory and 
systems theory, are well exposed in the books by Agler-McCarthy [AM02J, 
Foias-Frazho [F F90| and Rosenblum-Rovnyak [RR97]. The monograph of 
Constantinescu |Constantinescu96] is entirely devoted to the Schur algo- 
rithm. 

For the classical moment problem Akhiezer's text [Akh65j remains the 
basic reference, although having a look at Marcel Riesz original articles 
[MR23j, Carleman's quasi-analytic functions |C26j . or at the continued frac- 
tions monograph of Perron }Per50j might bring new insights. Good surveys 
of the multivariate moment problems are Berg |Bcrg87| and Fuglede [F83J. 
Reznick's memoir |R92j exploits in a novel and optimal way the duality 
between moments and positive polynomials. 

For real algebraic geometry, including the logical aspects of the theory, we 
refer to the well circulated texts [BCR98, J89[ IMOO] and the recent mono- 
graph by Prestel and Delzell [PDOlj ; the latter offers an elegant and full 
access to a wide selection of aspects of positive polynomials. For new re- 
sults in algorithmic real (commutative) algebra see |BPR03| ; all recent arti- 
cles of Lasserre contain generous recapitulations and reviews of past articles 
devoted to applications of sums of squares and moments to optimization. 
Scheiderer's very informative survey [S03J is centered on sums of squares de- 
compositions. Parrilo's thesis [ ParThesis] is a wonderful exposition of many 
new areas of application which he discovered. 

An account of one of the most systematic and elegant ways for produc- 
ing LMIs for engineering problems is the subject of the book [SIG97J. The 
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condensed version we heartily recommend is their 15 page paper [SI95J. 



Software: 

Common semi-definite programming packages are |Sturm 99] SeDuMi and 
LMI Toolbox |GNLC95j . 

Semi-algebraic geometry packages are SOS tools [PPSP04J and GloptiPoly 
[HL03] . 

For symbolic computation in a free *- algebra see NCAlgebra and NCGB 
(which requires Mathematica) [HSM05] . 

References 

[AM02] J. AGLER, J.E. McCARTHY, Pick interpolation and Hilbert function spaces. 
Graduate Studies in Mathematics, 44, Providence, RI: American Mathematical Soci- 
ety, 2002. 

[Akh65] N. I. AKHIEZER, The Classical Moment Problem, Oliver and Boyd, Edinburgh 
and London, 1965. 

[Amit57] S. A. AMITSUR, A generalization of Hubert's Nullstellensatz, Proc. Amer. 
Math. Soc. 8(1957), 649-656. 

[Art26] E. ARTIN, Uber die Zerlegung definiter Funktionen in Quadrate, Abh. math. Sem. 
Hamburg 5(1926), 100-115. 

[AS26] E. ARTIN, O. SCHREIER, Algebraische Konstruktion reeler Korper, Abh. math. 
Sem. Hamburg 5(1926), 85-99. 

[BGM05] J. A. BALL, T. MALAKORN, G. GROENEWALD, Structured noncommuta- 
tive multidimensional linear systems, SIAM J. Control and Optimization 44 (2005), 
no. 4, 1474-1528. 

[BT98] J. A. BALL, T.T. TRENT, Unitary colligations, reproducing kernel Hilbert spaces, 
and Nevanlinna-Pick interpolation in several variables, J. Funct. Anal. 157(1998), 
1-61. 

[BPR03] S. BASU, R. POLLACK, M.-F. ROY, Algorithms in real algebraic geometry, 
Algorithms and Computation in Mathematics, 10, Springer- Verlag, Berlin, 2003. 

[B01] C. L. BECK, On formal power series representations of uncertain systems IEEE 
TAC, 46, no. 2, 2001, 314-319. 

[Berg87] C. BERG, The multidimensional moment problem and semigroups. Moments in 
mathematics (San Antonio, Tex., 1987), Proc. Sympos. Appl. Math. ,37, Amer. Math. 
Soc, Providence, RI, 1987. pp. 110-124. 



96 



J. WILLIAM HELTON AND MIHAI PUTINAR 



[Berg91] C. BERG, M. THILL, Rotation invariant moment problems, Acta Math. 167 
(1991), no. 3-4, 207-227. 

[Berg96] C. BERG, Moment problems and polynomial approximation. 100 ans apres Th.- 
J. Stieltjes. Ann. Fac. Sci. Toulouse Math. (6) 1996, Special issue, 9-32. 

[Bert05] D. BERTSIMAS, I. POPESCU, Optimal inequalities in probability theory: a 
convex optimization approach, SIAM J. Optim. 15 (2005), no. 3, 780-804. 

[Blec04] G. BLEKHERMAN, There are significantly more nonnegative polynomials than 



[BCR98] J. BOCHNACK, M. COSTE, M-F. ROY, Real algebraic geometry, Translated 
from the 1987 French original. Revised by the authors, Ergebnisse der Mathematik 
und ihrer Grenzgebiete (3) [Results in Mathematics and Related Areas (3)], 36. 
Springer- Verlag, Berlin, 1998. 

[CHSY03] J. F CAMINO, J. W. HELTON, R. E. SKELTON, J. YE, Matrix inequalities: 
A Symbolic Procedure to Determine Convexity Automatically, Integral Equations and 
Operator Theory46(2003), 399-454. 

[CHSprept] J. F. CAMINO, J. W. HELTON and R.E. SKELTON, Solving Matrix In- 
equalities whose Unknowns are Matrices to appear SIAM Journal of Optimization 

[C26] T. CARLEMAN, Les Fonctions Quasi- Analytiques, Gauthier Villars, Paris, 1926. 

[Cas84] G. CASSIER, Probleme des moments sur un compact de R n et decomposition de 
polynomes a plusieurs variables, J. Funct. Anal. 58(1984), 254-266. 

[CD96] D.W. CATLIN, J.P. D'ANGELO, A stabilization theorem for Hermitian forms 
and applications to holomorphic mappings, Math. Res. Lett. 3(1996), 149-166. 

[CD99] D.W. CATLIN, J.P. D'ANGELO, An isometric imbedding theorem for holomor- 
phic bundles, Math. Res. Lett. 6(1999), 43-60. 

[Cim 00] J. CIMPRIC, Archimedean preorderings in non- commutative semi-algebraic ge- 
ometry, Comm. Algebra bf 28 (2000), 1603-1614. 

[CKSprept] J. CIMPRIC, S. KUHLMANN, C. SCHEIDERER, The Invariant Moment 
Problem, preprint (2005). 

[CW99] J.B. COLE, J. WERMER, Ando's theorem and sums of squares, Indiana Univ. 
Math. J. 48(1999), 767-791. 

[Connes76] A. CONNES, Classification of injective factors. Cases IIi,II 00 ,III\,\ 7^ 1, 
Ann. Math. 104(1976), 73-115. 

[Constantinescu96] T. CONSTANTINESCU, Schur parameters, factorization and dilation 
problems, Operator Theory: Advances and Applications, 82 , Birkhuser Verlag, Basel, 




1996. 



POSITIVE POLYNOMIALS AND THE SPECTRAL THEOREM 



97 



[CF05] R.E. CURTO, L.A. FIALKOW, Truncated K-moment problems in several vari- 
ables, J. Operator Theory 54 (2005), 189-226. 

[d'A05] J. D'ANGELO, Complex variables analogues of Hubert's seventeenth problem Int. 
J. Math. 16, No. 6 (2005), 609-627. 

[d'AV03] J. D'ANGELO, D. VAROLIN, PosiUvity conditions for Hermitian symmetric 
functions, Asian J. Math. 7(2003), 1-18. 

[Dem02] O.DEMANZE, Probleme des moments multi-dimentionnel et sous-normalite 
jointe, These de doctorat., Univ. Lille I, 2002. 

[DGKF89] J. C. DOYLE, K. GLOVER, P. P. KHARGONEKAR, B. A. FRANCIS, State- 
space solutions to standard H2 and H x control problems. IEEE Trans. Automat. 
Control 34 (1989), 831-847. 

[D03] R.G. DOUGLAS, Banach algebra techniques in the theory of Toeplitz operators, 
Springer, Berlin, 2003. 

[Dub69] D.W. DUBOIS, A nullstellensatz for ordered fields, Ark. Mat. 8(1969), 111-114. 

[DHMprept] H. DYM, J. W. HELTON, S. A. MCCULLOUGH, The Hessian of a Non- 
commutative Polynomial has Numerous Negative Eigenvalues, preprint, pl-48. 

[E36] M. EIDELHEIT, Zur Theorie der konvexen Mengen in linearen normierten 
Rdumen, Studia Math. 6(1936), 104-111. 

[EP02] J. ESCHMEIER, M. PUTIN AR, Spherical contractions and interpolation problems 
on the unit ball, J. Reine Angew. Math. 542(2002), 219-236. 

[FF90] C. FOIAS, A.E. FRAZHO, The commutant lifting approach to interpolation prob- 
lems, Operator Theory: Advances and Applications, 44(1990), Birkhauser, Basel. 

[F83] B. FUGLEDE, The multidimensional moment problem, Expo. Math. 1, 47-65 
(1983). 

[GNLC95] P. GAHINET, A. NEMIROVSKII, A.J. LAUB, M. CHILALI, LMI Control 
Toolbox, The Math Works, Inc., USA, 1995. 

[GP04] K. GATERMANN; P. A. PARRILO, Symmetry groups, semi-definite programs, 
and sums of squares. J. Pure Appl. Algebra 192 (2004), 95-128. 

[GV61] I.M. GELFAND, N.I. VILENKIN, Generalized Functions, Vol. IV, (in Russian), 
Gos. Iz. Fiz.-Mat., Moscow, 1961. 

[HW93] G.H. HARDY, E.M. WRIGHT, An Introduction to the Theory of Numbers, 
Clarendon Press, Oxford, 1993. 

[HanT06] F. HANSEN, J. TOMIYAMA Differential analysis of matrix convex functions, 
Linear Algebra and its Applications (2006) 



98 



J. WILLIAM HELTON AND MIHAI PUTINAR 



[Han97] F. HANSEN, Operator convex functions of several variables, Publ. RIMS, Kyoto 
Univ. 33 (1997), 443-464 

[HT53] E. HELLINGER, O. TOEPLITZ, Integralgleichmgen und Gleichungen mit un- 
endlichvielen Unbekannten, Chelsea Publ., New York, 1953. 

[H02] J.W.HELTON, "Positive" noncommutative polynomials are sums of squares., Ann. 
of Math. (2) 156 (2002), 675-694. 

[H03] J. W. HELTON, Manipulating Matrix Inequalities Autmatically, volume of Plenary 
Talks at the conference "Mathematical Theory of Networks and Systems" 2002, Inst. 
Math. Analysis Series on Math and Appl., vol. 134 pp 237-257, Springer, Berlin, 
2003. 

[HM04a] J.W.HELTON, S. McCULLOUGH, A Positivstellensatz for non- commutative 
polynomials., Trans. Amer. Math. Soc. 356 (2004), 3721-3737. 

[HM04b] J.W.HELTON, S. McCULLOUGH, Convex noncommutative polynomials have 
degree two or less, SIAM J. Matrix Anal. Appl. 25 (2004), 1124-1139. 

[HMP04a] J.W.HELTON, S. McCULLOUGH, M. PUTINAR, A non-commutative Posi- 
tivstellensatz on isometries., J. Reine Angew. Math. 568 (2004), 71-80. 

[HMP04b] J.W.HELTON, S. McCULLOUGH, M. PUTINAR, Non-negative hereditary 
polynomials in a free *-algebra, Math. Zeitschrift 250(2005), 515-522. 

[HMPpos] J.W.HELTON, S. McCULLOUGH, M. PUTINAR, Matrix Representations of 
Positive Noncommutative Polynomials, Positivity 10(2006), 145-163. 

[HMP06] J.W.HELTON, S. McCULLOUGH, M. PUTINAR, Strong majorization in a 
free *-algebra, Math. Zeitschrift, to appear. 

[HSM05] J. W. HELTON, M. STANKUS, R.L. MILLER, NCAlgebra and NCGB 2005 
Release, 

\protect\vrule widthOpt\protect\href {http: //math.ucsd. edu\string~ncalg}{http : //math.ucsd. edu$\sim: 

[HMVprept] J. W. HELTON, S. A. MCCULLOUGH, V. VINNIKOV Noncommutative 
convexity arises from Linear Matrix Inequalities, pp 1- 85, to appear J. Functional 
Analysis. 

[HVprept] J. W. HELTON, V. VINNIKOV, Linear Matrix Inequality Representation of 
Sets, prepint http:// arXiv.org posted June 2003, pp. 1-22, to appear in Comm. 
Pure and Appl. Math. 

[HL03] D. HENRION, J.-B. LASSERRE, GloptiPoly: global optimization over polynomi- 
als with Matlab and SeDuML, ACM Trans. Math. Software 29 (2003), no. 2, 165-194. 



POSITIVE POLYNOMIALS AND THE SPECTRAL THEOREM 



99 



[HL05] D. HENRION, J.-B. LASSERRE, Detecting global optimality and extracting so- 
lutions in GloptiPoly, Positive polynomials in control, Lecture Notes in Control and 
Inform. Sci.,312, Springer, Berlin, 2005, pp. 293-310. 

[HL06] D. HENRION, J.-B. LASSERRE, Convergent relaxations of polynomial matrix 
inequalities and static output feedback, IEEE Trans. Automatic Control 51(2006), 
192- 202. 

[Hilbl888] D. HILBERT, Uber die Darstellung definiter formen als Sumraen von Formen- 
quadraten, Math. Ann. 32(1888), 342-350. 

[Hilbl890] D. HILBERT, Mathematische Probleme, Gottinger Nachr. (1990), 253-297.. 

[Hilbl910] D. HILBERT, Hermann Minkowski, Math. Ann. 68(1910), 445-471. 

[Hilbl953] D. HILBERT, Grundziige einer allgemeiner Theorie der Linearen Integralgle- 
ichungen, Chelsea Publ., New York, 1953. 

[HSch95] C. W. HOL, C. W. SCHERER, A sum-of-squares approach to fixed-order H^- 
synthesis, Positive polynomials in control, Lecture Notes in Control and Inform. Sci., 
312, Springer, Berlin, 2005, pp. 45-71. 

[Horm83] L. HORMANDER, The Analysis of Linear Partial Differential Operators. II: 
Differential Operators with Constant Coefficients, Springer, Berlin, 1983. 

[J89] N. JACOBSON, Basic Algebra. II, Freeman, San Francisco, 1989. 

[K37] S. KAKUTANI, Ein Beweis des Satzes von M. Eidelheit uber konvexe Mengen, 
Proc. Imp. Acad. Tokyo 13(1937), 93-94. 

[K05] D.S. KALYUZHNYI-VERBOVETZKI, Caratheodory interpolation on the non- 
commutative polydisk, J. Funct. Anal. 229 (2005), 241-276. 

[KS05] I. KLEP, M. SCHWEIGHOFER, A Nichtnegtaivstellensatz for polynomials in non- 
commuting variables, Israel J. Math., to appear. 

[KS06] I. KLEP, M. SCHWEIGHOFER, Connes' embedding conjecture and sums of her- 
mitian squares, preprint 2006. 

[KM70] A. G. KOSTYUCENKO, B. S. MITYAGIN, Positive- definite functional on nu- 
clear spaces. Trudy Moskov Mat. Obsc. (in Russian) 9, 283-316 (1960); English trans- 
lation in Amer. Math. Soc. Transl. (ser. 2) 93(1970), 1-43. 

[K69] G. KOTHE, Topological Vector Spaces. I, Springer, Berlin, 1969. 

[KN81] M.G. KREIN, M.A. NAIMARK, The method of symmetric and Hermitian forms 
in the theory of separation of the roots of algebraic equations, (Translated from the 
Russian by O. Boshko and J. L. Howland), Linear Multilinear Algebra 10(1981), 
265-308. 

[K36] F. KRAUS, "Uber Konvexe Matrixfunctionen," Math. Zeit. 41 (1936) 18 - 42. 



100 



J. WILLIAM HELTON AND MIHAI PUTINAR 



[K64] J. -L. KRIVINE, Anneaux prior donnes, J. Analyse Math. 12(1964), 307-326. 

[KM02] S. KUHLMANN, M. MARSHALL, Positivity, sums of squares and the multi- 
dimensional moment problem, Trans. Amer. Math. Soc. 354(2002), 4285-4301. 

[KMS05] S. KUHLMANN, M. MARSHALL, N. SCHWARTZ , Positivity, sums of squares 
and the multi- dimensional moment problem. II, Adv. Geom. 5 (2005), 583-606. 

[LOO] J. B. LASSERRE, Optimisation globale et theorie des moments, C. R. Acad. Sci. 
Paris 331 Serie 1, 929-934. 

[L01] J.B. LASSERRE, Global optimization with polynomials and the problem of moments, 
SIAM J. Optim. 11(2001), 796-817. 

[LN06] J. B. LASSERRE, T. NETZER, SOS approximations of non-negative polynomials 
via simple high degree perturbations, Math. Z., to appear. 

[L04] J.B. LASSERRE, T. PRIETO-RUMEAU, SDP vs. LP relaxations for the moment 
approach in some performance evaluation problems, Stoch. Models 20 (2004), no. 4, 
439-456. 

[LPR05] A. S. LEWIS, P. A. PARRILO, M. V. RAMANA, The Lax conjecture is true, 
Proc. Amer. Math. Soc. 133 (2005), no. 9, 2495-2499. 

[MOO] M. MARSAHALL, Positive Polynomials and Sums of Squares, Instituti Edit. 
Poligraf. Int. Pisa, Roma, 2000. 

[M03] J. E. MCCARTHY, Pick's theorem—what's the big deal? Amer. Math. Monthly 
110 (2003), 36-45. 

[M86] H. MINKOWSKI, Untersuchungen iiber quadratische formen, Acta Math. 7(1886), 
201-256. 

[N59] E. NELSON, Analytic vectors, Ann. Math. 70(1959), 572-615. 

[NN94] Y. NESTEROV, A. NEMIROVSKII, Interior Point Polynomial Algorithms in 
Convex Programming, SIAM Studies 13, 1994. 

[vNl] J. von NEUMANN, Algemeine Eigenwerttheorie Hermitischer Funktionalopera- 
toren, Math. Ann. 102(1929), 49-131. 

[vN2] J. von NEUMANN, Eine Spektraltheorie fur allgemeine Operatoren eines unitaren 
Raumes, Math. Nachr. 4 (1951), 258-281. 

[OSTprept] I. OSAKA, S. SILVESTEROV and J. TOMIYAMA, Monotone operator 
functions, gaps and the power moment problem March 1, 2006 peprint pp32. 

[NSprept] J. NIE, M. SCHWEIGHOFER, On the complexity of Putmar's Positivstellen- 
satz, preprint 2005. 



POSITIVE POLYNOMIALS AND THE SPECTRAL THEOREM 



101 



[ParThesis] P.A. PARRILO, Structured bsemidefinite programs and semi- algebraic geome- 
try methods in robustness and optimization, PhD Thesis, California Inst. Technology, 
Pasadena, 2000. 

[PS03] P.A. PARRILO, B. STURMFELS, Minimizing polynomial functions, Algorithmic 
and quantitative real algebraic geometry (Piscataway, NJ, 2001), pp. 83-99, DIMACS 
Ser. Discrete Math. Theoret. Comput. Sci. 60, Amer. Math. Soc, Providence, RI, 
2003. 

[1] P. A. PARRILO Exploiting algebraic structure in sum of squares programs. Positive 
polynomials in control, 181-194, Lecture Notes in Control and Inform. Sci., 312, 
Springer, Berlin, 2005. 

[PPSP04] S. PRAJNA, A. PAPACHRISTODOULOU, P. SEUILER, P. A. PARRILO, 
SOSTOOLS and its control applications. Positive polynomials in control, pp. 273- 
292, Lecture Notes in Control and Inform. Sci. 312, Springer, Berlin, 2005. 
Download Software Package: SOSTools, Cal Tech Oct 2004, 

{http : //www . cds . caltech . edu/ sostools} 

[PPR04] S. PRAJNA, P. A. PARRILO, A. RANTZER, Nonlinear control synthesis by 
convex optimization. IEEE Trans. Automat. Control 49 (2004), 310-314. 

[Per50] O. PERRON, Die Lehre von den Kettenbriichen, Zweite verbesserte Auflagc, 
Chelsea Publ. Comp. (reprint), New York, 1950. 

[PS25] G. POLYA, G. SZEGO, Aufgaben und Lehrsatze aus der Analysis, 2 vols., Julius 
Springer, Berlin, 1925. 

[PR01] V. POWERS, B. REZNICK, A new bound for Polya's theorem with applications 
to polynomials positive on polyhedra, Effective methods in algebraic geometry (Bath, 
2000), J. Pure Appl. Algebra 164 (2001), 221-229. 

[PPP02] S. PRAJNA, A. PAPACHRISTODOULOU, P. A. PARRILO, SOSTOOLS, 
Sums of squares optimization toolbox for MATLAB, California Inst. Technology, 
Pasadena, 2002. 

[PD01] A. PRESTEL, C.N. DELZELL, Positive polynomials. From Hubert's 17th problem 
to real algebra, Springer Monographs in Mathematics, Springer, Berlin, 2001. 

[P05] A. PRESTEL, Representation of real commutative rings, Expo. Math. 23(2005), 
89-98. 

[PS76] C. PROCESI, M. SCHACHER, A non- commutative real Nullstellensatz and 
Hilbert's 17th problem, Ann. of Math. (2) 104 (1976), 395-406. 

[P92] M. PUTIN AR, Sur la complexification du probleme des moments, C. R. Acad. Sci., 
Paris, Serie I 314(1992), No. 10, 743-745. 



102 



J. WILLIAM HELTON AND MIHAI PUTINAR 



[P93] M. PUTINAR, Positive polynomials on compact semi-algebraic sets, Indiana Univ. 
Math. J. 42(1993), 969-984. 

[P06] M. PUTINAR, On hermitian polynomial optimization, Arch. Math. 87(2006), 41- 
51. 

[Q68] D.G. QUILLEN, On the representation of hermitian forms as sums of squares, 
Invent. Math. 5(1968), 237-242. 

[Radul04] F. RADULESCU, A non- commutative, analytic version of Hubert's 17-th prob- 
lem in type Hi von Neumann algebras, preprint 2004. 

[R92] B. REZNICK, Sums of even powers of real linear forms, Mem. Amer. Math. Soc. 
96 (1992), Providence, R.I. 

[R95] B. REZNICK, Uniform denominators in Hubert's seventeenth problem, Math. Z. 
220 (1995), 75-97. 

[R13] F. RIESZ, Les systemes d'equations lineaires a une infinite d'inconnues, Gauthier- 
Villars, Paris, 1913. 

[RN90] F. RIESZ, B. SZ.-NAGY, Functional analysis, Transl. from the 2nd French ed. by 
Leo F. Boron. Reprint of the 1955 orig. publ. by Ungar Publ. Co., Dover Books on 
Advanced Mathematics, Dover Publications, Inc., New York, 1990. 

[MR23] M. RIESZ, Sur le probleme des moments. Troisieme Note, Ark. Mat. Fys. 
16(1923), 1-52. 

[RR97] M. ROSENBLUM, J. ROVNYAK, Hardy classes and operator theory, Corrected 
reprint of the 1985 original, Dover Publications, Inc., Mineola, NY, 1997. 

[S05] C. SCHEIDERER, Non-existence of degree bounds for weighted sums of squares 
representations, J. Complexity 21 (2005), 823-844. 

[S03] C. SCHEIDERER, Positivity and sums of squares: A guide to some recent results, 
Preprint, 2003. 

[S91] K. SCHMUDGEN, The K -moment problem for compact semi-algebraic sets., Math. 
Ann. 289 (1991), 203-206. 

[S05] K. SCHMUDGEN, A strict Positivstellensatz for the Weyl algebra, Math. Ann. 331 
(2005), 779-794. 

[S18] I. SCHUR, Uber Potenzreihen, die im Innern des Einheitskreises beschrankt sind.I, 
II, J. Reine Angew. Math. 147(1917), 205-232; ibidem 148(1918), 122-145. 

[S04] M. SCHWEIGHOFER, On the complexity of Schmiidgen's positivstellensatz, J. Com- 
plexity 20 (2004), no. 4, 529-543. 

[S05] M. SCHWEIGHOFER, Optimization of polynomials on compact semi-algebraic sets, 
SIAM J. Optim. 15 (2005), no. 3, 805-825. 



POSITIVE POLYNOMIALS AND THE SPECTRAL THEOREM 



103 



[S54] A. SEIDENBERG, A new decision method for elementary algebra, Ann. Math. 
60(1954), 365-374. 

[ST43] J. A. SHOHAT, J. D. TAMARKIN, The Problem of Moments, Amer. Math. Soc, 
Providence, R.I., 1943. 

[S87] N.Z.SHOR, Class of global minimum bounds for polynomial functions, Cybernetics 
23(1987), 731-734. 

[SIG97] R. E. SKELTON, T. IWASAKI, K. M. GRIGORIADIS, A Unified Algebraic Ap- 
proach to Linear Control Design, Taylor & Francis, San Francisco, 1997. 

[SI95] R. E. SKELTON, T. IWASAKI, Eye on Education: Increased Roles of Linear 
Algebra in Control Education, IEEE Control Systems Magazine, Vol. 15, No. 4, pp. 
76-90, August 1995. 

[S74] G. STENGLE, A Nullstellensatz and a Positivstellensatz in semi- algebraic geometry, 
Math. Ann. 207(1974), 87-97. 

[Sturm99] J. F. STURM, Using SeDuMi 1.02, a MATLAB Toolbox for Optimization over 
Symmetric Cones, j-OMS, 1999, 11/12, 1-4, 625-653. 

[SNF67] B. SZ.-NAGY, C. FOIAS, Analyse harmonique des operateurs de I'espace de 
Hilbert, Budapest: Akademiai Kiado; Paris: Masson et Cie, 1967. 

[Tak02] M. TAKESAKI, Theory of operator algebras. I, Reprint of the first (1979) edi- 
tion, Encyclopaedia of Mathematical Sciences, 124, Operator Algebras and Non- 
commutative Geometry, 5, Springer- Verlag, Berlin, 2002. 

[T86] A. TARSKI, Collected papers. Vol. 3. 1945-1957, Edited by Steven R. Givant and 
Ralph N. McKenzie, Contemporary Mathematicians, Birkhauser Verlag, Basel, 1986. 

[U02] M. UCHIYAMA, Operator monotone functions and operator inequalities, Sugaku 
Expositions 18 (2005), no. 1, 39-52. 

[Val79] L. VALIENT, Completeness classes in algebra, Proc. 11th ACM Annual Symp. 
on the Theory of Computing, pp. 249-261, 1979. 

[Var06] D. VAROLIN, Geometry of Hermitian algebraic functions. Quotients of squared 
norms, preprint 2006. 

Department of Mathematics, University of California at San Diego, La 
Jolla CA 92093 

E-mail address: helton@math.ucsd.edu 

Department of Mathematics, University of California, Santa Barbara, CA 
93106 

E-mail address: mputinar@math.ucsb.edu 



